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5 GENE TARGETING METHOD 

CROSS REFERENCE TO RELATED APPLICATIONS 

10 This application claims priority from U.S. Provisional Patent Application Nos. 

60/258,682 filed December 28, 2000, 60/188,672, filed March 13, 2000, and 60/187,220, 
filed March 3, 2000. 
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BACKGROUND OF THE INVENTION 

20 

When exogenous DNA or RNA is introduced into a cell, the cell is said to be transformed. 
Various methods are known by which the transforming nucleic acid becomes a permanent part of 
the transformed cell's genome. Unless specialized methods are used, permanent transformation 
is usually the result of integration of the transforming nucleic acid in chromosomal DNA at a 

25 random location. The transforming DNA can also be introduced into the cell on a plasmid that 
replicates autonomously within the cell and which segregates copies to daughter cells when the cell 
divides. Either way, the locus of die transforming nucleic acid with respect to endogenous genes 
of the cell is unspecified. Gene targeting is the general name for a process whereby chromosomal 
integration of the transforming DNA at a desired genetic locus is facilitated, to the extent that 

30 permanently transformed cells having the DNA at that locus can be obtained at a useful frequency. 
Typically, the gene at the target locus is modified, replaced or duplicated by the transforming 
(donor) nucleic acid. 
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integration events that have occurred (or select against undesired integration events). Without 
such steps, the desired integration might occur by chance, but with such a low frequency as 
to be undetectable. 

Yeast (Saccharomyces cerevisiae) has been a useful organism for development of gene 
targeting methods. Rothenstein, R. (1991) Methods in Enzymology 194:281-301 reviewed 
techniques of targeted integration in yeast The normal yeast process of homologous 
recombination was shown to permit integration of transforming plasmid DNA having a 
segment of sequence homologous to a yeast gene. When a double-strand break was introduced 
within a homologous segment, transformation with the resulting linear DNA resulted in a 10- 
1000-fold increased incidence of integration at or near the break The longer the region of 
homology on eftherside of the break, the greater the frequency of recombination at me desired 
locus. Strategies for gene replacement, gene disruption and rescue of mutant alleles were 
described. 



The studies of gene targeting in yeast have been facilitated by the fact that individual 
transformed cells can be isolated and grown in pure culture to any convenient amount In 
addition, the short doubling time of yeast cells in culture has allowed researchers to observe 
events that occur with a low frequency and to study the genetics of those events within a 

20 convenient time scale. When working with complex multiceUular organisms, the number of 
individuals which can be assessed for a genetic change, and the time scale required for 
observing patterns of inheritance are both increased. To achieve practical gene targeting in 
such organisms, techniques were developed to increase the frequency of observable targeting 
events and to hicrease the efficfemy of selection for desh^ Practical methods of gene 

25 targeting have been developed in the fruit fly, Drosophila melanogaster, and in the mouse, 
Mas musculo, however such methods have not been applicable to a wider range of organisms \ 

Transposons have been utilized for inducing gene targeting m Drosophila. 
Gloor, G.B., et al. (1991) Science, 253:1110-1117 described utilizing the property of the P 
30 element transposon to generate a double strand gap when a transposition event occurs, the gap 
being located at the site formerly occupied by the transposon. Under most circumstances the 
resulting gap is repaired by copying from homologous sequences on the sister chromatid. If 
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a homologous sequence is present in the cell at an ectopic locus, for example on a plasmid, that 
sequence can also serve as a template to repair the double strand gap generated by the 
transposon's departure. This type of gap repair can then be employed to target a desired 
sequence to the locus of the departing transposon. , The primary limitation of the process is that 
5 the host organism must have a transposon located at or near the target site. 

The FLP-FRT recombinase system of yeast was employed to mobilize PRT-flanked 
donor DNA and generate re-integration at a different chromosomal location (Golic, M.M. , et 
al, (1997) NucL Acids Res. 25:3665-3671). The donor DNA was introduced into the 

10 Drosophila chromosome flanked by repeats of the FRT recombinase recognition site, all within 
a P element for integration. The FLP recombinase was introduced under control of a heat- 
shock promoter, so that the enzyme could be activated by the investigators at a specified time. 
The action of FLP recombinase could result in excision of the donor DNA followed by a 
second round of recombination at a target site where another FRT site was present. The 

15 phenomenon could be observed by using flies having the target FRT site at the locus of a 
known gene where an altered phenotype was detectable. 

Gene targeting in mammals has only been achieved to any significant degree in the mouse. 
Uniquely in the case of the mouse, a pluripotent cell line exists, embryonic stem (ES) cells that 

20 can be grown in culture, transformed, selected and introduced into an embryonic stage, the 
blastocyst stage of the mouse embryo. Embryos bearing inserted transgenic ES cells develop as 
genetically chimeric offspring. By interbreeding siblings, homozygous mice carrying the 
selected genes can be obtained. An overview of the process and its limitations is provided by 
Capecchi, M. R, (1989) Trends in Genetics 5:70-76; and by Bronson, S.K. (1994) /. Biol 

25 Chem. 269: 27155-25158. Both homologous and non-homologous recombination occur in 
mammalian cells. Both processes occur with low frequency and non-homologous recombination 
occurs more frequently than homologous recombination. ES cells are transfected with a DNA 
construct that combines a donor DNA having the modification to be introduced at the target site 
combined with flanking sequence homologous to the target site, and marker genes, as needed, 

30 for selection, as well as any other sequences that may be desired. The donor construct need not 
be integrated into the chromosome initially, but can recombine with the target site by 
homologous recombination or at a non-target site by non-homologous recombination. Since 
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these even* are rare, dual selection is -quired to select for reco^inan^ and to selea ag^ 
non^logousrec^inants. TheselecdcH.arec^om^^ontbeEScellsinc^ 
PGR screening can also be employed to identify desired recombinants. The frequency of 
homologous recombinationis increased as the length of the region of homology m 
-^.wimatleast^ofhomologybeingprefe^ However homologous recombination 
has been observed with as little as 25-50bp of homology. Donor DNA having small deletions 
or mserdons of the target sequence are mtrc^ 

pomt mutations. Both insertions of sequence and replacement of the target, as well as 
duplication in whole or in part of the target can be accomplished, by appropriate design of the 
donor vector and the selection system, as desired for the purpose of the targeting. 

Gene targeting in mammals other than the mouse has been limited by lack of ES cells 
capable of being transplanted and of contributing to germ line cells of developing embryos 
However techniques related to cloning teehmlogy have open^ 
targeting to other species. McCreatfa, K.J., et al (2000) Nature 405:1066-1069 have reported 
successful targetinginsheep by carrying out transform^ 

embryo fibroblast cells. The targeted fibroblast nuclei were then transferred to enucleated egg 
cells followed by implantation in the uterus of a host mother. The technique provides the 
advantage that the generation of clnmeric animals and subsequent breeding to homozygosity 
are not required. However the time available for carrying out targeting and selection is short 



The use of recombinases and their recognition sites has proven to be a valuable tool 
once the iuitial targeting event has been achieved. For a review of the techniques applying the 
site specffic recombinase systems, see Sauer, B. et al, (1994) Current Opinion in Biotech 

25 5:521-527. See also U.S. Patent 4,959,317. For example, repeated targeting at a given locus 
is facilitated by including recombination-specific recombination sites in the initial targeting 
construct. Once in place, the recombination sites can be used, in combination with their 
respective recombinase, to provide highly efficient transfer of anexogenous DNA to the locus 
of the recombination site. A recombinase system commonly used is the Cre recombinase 

30 which recognizes a sequence designated loxP. The Cre recombinase and Precognition site 
are derived from bacteriophage PI. Another widely used system, derived from the 2u circle 
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of Saccharomyces cerevisiae, istheFLP recombinase which recognizes a specific sequence, 
FRT. In both systems, the effect of recombinase activity is determined by the orientation of 
the recognition sites flanking a given segment of DNA. A DNA sequence flanked by directly 
repeated recombination sites and then integrated into the genome by either homologous or 

5 megihmate recombination can subsequently be removed simply by providing the corresponding 
recombinase. One useful consequence of this property has been exploited to remove an 
unwanted selection marker from the target site once homologous recombination has occurred 
and selection is no longer necessary. In another application, a gene which may exert a toxic 
effect can be maintained in a dormant state by inserting a fox-flanked sequence between the 

10 promoter and the gene, the sequence being designed to prevent expression of the gene. 
Expression of Cre activity results in excision of the intervening sequence and allows to 
promoter to act to activate the dormant gene. Cre can be introduced by mating or provided in 
an inducible form that permits activation at the investigator's control. A variety of other post- 
targeting strategies can be facilitated by the use of site specific recombination systems, as 

15 known in the art. 

As has been shown in yeast, introducing a ds break into DNA increases reconciliation 
frequency. A number of studies have demonstrated that introducing a ds break into a target 
^ increased recombination with a homologous donor DNA about 100-fold. The ds break 
20 was created by providing an I-Scel site in the target DNA, then introducing and expressing 
an I-Scel endonuclease along with a donor DNA homologous to the target. Using Chinese 
hamster ovary (CHO) cells, Sargent, R.G. et al (1997) Mol. Cell. Biol. 17:267-277 described 
an experiment for testing crossovers between tandem repeats of an APRT gene, one of which 
carried anl-Scel site. The occurrence of homologous recombmation «>uld be measured by 
25 crossovers between the tandem APRT loci, which eliminated an intervening thynudine kinase 
CTk + ) gene, or within different segments of the APRT gene itself, based on the presence or 
" absencemmeprogeny.ofcertammnn^ 

was generated at the I-Scel site by introducing and expressing the I-Scel endonuclease carried 
on a separate expression vector and introduced by transformation. A similar type of 
30 demonstration was reported by Liang, F. et al (1998) Proc. Natl. Acad. Sci. USA 95:5172- 
5177. Cohen-Tannoudji, M. et al (1998) Mol. Cell. Biol. 18:1444-1448 described the use 
ofanl-Scelsitemtroduced^ Once in place, other 
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constructs could be introduced at the same target ("knocked in*) by a subsequent 
transformation with a desired donor construct and transient expression of I-Scel endonuclease 
to introduce a ds-break at the target The efficiency of the second targeting step was 
reportedly 100-fold greater than was observed for conventional targeting. The method had 
5 the disadvantage that an I-Scel site was required at the target site. 



U.S. Patent 5,962,327 describes the I-Scel endonuclease and its recognition site. The 
patent also discloses general strategies using I-Scel that can be attempted for the site-specific 
insertion of a DNA fragment from a plasmid into a chromosome. A diagram of site-directed 
10 homologous recombination in yeast is presented. It should be noted that this technique was 
shown only in yeast. 

In plants, spontaneous homologous recombination events have been characterized as 
"extremely rare" (Puchta, H. (1999) Gfenerfcy 199:1173-1181). Introduction of ds-breaks has 

15 been shown to increase the homologous recombination frequency. Puchta, H. et al (1996) 
Proc. Natl Acad. ScL USA 93:5055-5060 reported introducing (by T-DNA mediated 
transformation) a target locus bearing an I-Scel site and a partial kanamycin resistance gene. 
Ina second round of transformation, a repair construct was introduced along with an I-Scel 
expression cassette. Homologous recombination to restore kanamycin resistance was detected 

20 by the presence of kanamycjn-resistant callus cells. 

SUMMARY OF THE INVENTION 

The present invention includes methods and compositions for carrying out gene 
25 targeting. Unlike previously known methods for gene targeting in multicellular organisms, the 
present invention does not depend on availability of a pluripotential cell line, and hence can 
be adapted for gene targeting in any organism. The method exploits homologous 
recombination processes that are endogenous in the cells of all organisms. Any gene of an 
organism can be modified by the method of the invention as long as the sequence of the gene, 
30 or a portion of the gene, is known, or if a DNA clone is available. 
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"Target" is the term used herein to identify the genetic element or DNA segment to be 
modified. "Donor" is used herein to identify those genetic elements or DNA segments used 
to modify the target The modification can be any sort of genetic change, including 
substitution of one segment for another , insertion of single or multiple nucleotide replacements , 
5 deletion, insertion, duplication of all or part of the target, and combinations thereof. 

In general outline, a donor construct is provided within cells of the organism. The 
donor construct can be integrated anywhere in the genome, without regard to the locus of die 
target Alternatively, the donor construct can be carried on an autonomously replicating 

10 genetic element, or present transiently. The donor construct includes a version of the target, 
the target modifying sequence, containing any sequence modifications to be introduced at the 
target site and also having a unique endonuclease site. Action of an endonuclease able to 
recognize the unique site results in a double strand break within die modifying sequence, 
generating a recombinogenic donor. Prior to, or in combination with, generating the double 

15 strand break, the donor construct is excised from its locus of integration, by various means 
described hereinafter. The combination of the excision and endonuclease cutting frees the 
recombinogenic donor to undergo homologous recombination at the target site resulting in the 
desired genetic change at the target. If the donor construct is not chromosomally integrated, 
but merely present on a plasmid in the host cell, the excision step is not needed. As described 

20 herein, the use of various selectable markers at specified positions of the donor construct 
relative to the modifying sequence facilitates identifying recombinants and selecting for the 
desired type of recombinant. 

The timing of the excision and endonuclease steps is controlled by maintaining the 
25 enzymes that catalyze these reactions under inducible or tissue-specific expression control. 
The genes encoding the enzymes combined with their promoters or mRNA encoding the 
enzymes or the enzymes themselves can be introduced to the organism concomitantly with the 
donor construct. Alternatively, a transgenic strain of the organism carrying the genes can be 
provided by a prior step of transformation and selection. Such a strain is termed herein a 
30 carrier host organism. A carrier host organism is useful as a host for all desired target gene 
modifications of the host species. 
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Many alterations and variations of the invention exist as described herein. The 
invention is exemplified for gene targeting in the insect, Drosophita, and in the plant, 
Arabidopsis. In both these organisms nucleotide sequences are known for most of the genome. 
Increasingly larger segments of genomic sequences are becoming known for a growing number 
5 of organisms. The functional elements used to carry out the steps of the invention are known 
for any desired organism. Therefore the present invention can be adapted for application in 
any organism. The invention therefore provides a general method for gene targeting in any 
organism, as well as a method for making a carrier host strain of any organism. Products of 
the invention include transformation vectors for gene targeting that include a modifying 
10 sequence having a unique, endomiclease recognition site associated therewith such that 
endonuclease cutting at the site yields a recombinogenic donor. The invention also provides 
a transformation vector for generating a carrier host organism including an endonuclease 
capable of making double strand break in DNA at the unique site, the endonuclease being 
under control of an inducible promoter. 

15 

DESCRIPTION OF THE DRAWINGS 

Figure 1 is a diagram demonstrating I-Scel cutting efficiency (Example 1). The 
reporter constructs were transformed via P elements (indicated by small arrowheads), and 
20 carried the I-Scel cut site (as indicated) either (A) adjacent to a shortened version of the wild 
type w + gene (indicated by the large solid arrow), or (B) flanked by a complete copy and a 
non-functional partial copy of that w + gene. The complete gene is -4.5 kb in length and the 
non-functional partial gene is -3.5 kb. 

25 Figure 2 is a diagram showing the construct for yellow targeting. At the top is 

diagramed the donor construct (Pjy-donor]) as it would appear in the chromosome when 
initially transformed via P element transformation. Diagramed beneath that is the form of the 
extrachromosomal donor DNA after FLP-mediated excision and I-Scel cutting. The arrow 
indicates transcriptional direction of yellow. Cut site: 18 bp I-Scel recognition sequence, 

30 p2t:p2t tubulin gene.p3t coding region of P3 tubulin gene. S: restriction site for Sail. 
Underlines indicate the DNAs used as probes for chromosome in situ hybridization and 
Southern blot analyses. 

8 
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Figure 3 is a diagram of gene targeting configurations. Two typical forms of gene 
targeting constructs are shown, and the results of their recombination with the target locus . 

Figure 4 is a diagram of crossing schemes for yeltow rescue (Example 2). 

Figure 5 shows cytologic*! localization of a targeted insertion. The cytological 
of P2t hybridization are indicated on me chromosomes of this //y + Class HI female. 



10 Figure 6 is a diagram showing types of targeting events . The four classes of recovered 

targeting events are shown, with the likely mechanism of origin for each indicated at the left, 

and me product of each event at the right. The donor construct is diagramed as in Figure 2. 

The approximate position of the point mutation in/ is indicated by an asterisk. The expected 

sizes of the DNA fragments produced by Sail digestion are shown below each product at the 
15 right the presumed allelomorphs of y are inmcated above each copy of the gene. The 

approximate locations of the insertions (V) and deletions (A) found in Class m events are 

indicated. 

Figure 7 provides results of Southern blot analyses of targeting events. Roman 
20 numerals indicate thetype of targeting event by class type. Lanes 1 and 13 are controls: CI 
is DNA from/ males; C2 is DNA from/ males that also carry the donor construct shown 
in Figure 2. 

Figure 8 is a diagram of gene knock-out by targeting with a truncated gene. The donor 
25 DNA used for targeting consists of a truncated gene, missing portions at both the 5' and the 
3- ends. Donor integration disrupts the endogenous gene by sphtting it into two pieces, each 
having a deletion of a different part of the gene. 

Figure 9 is a diagram of a two-step method for introducing a mutation into a target 
30 zone. I-Crel is a rare-cutting endonuclease. 
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Figure 10 is a diagram of a donor construct for gene faceting in plants transformed via 
T-DNA. "kanR" denotes a kanamycin resistance marker gene. "GFP" is a green fluorescent 
protein marker gene. 



Figure 1 1 is a diagram of a donor construct designed for targeting using a transposase 
to excise the recombinogenic donor. 

Figure 12 is a diagram of a donor construct designed for carrying out the steps of the 
invention using a recombinase and a transposase. 

Figure 13 is a diagram of a donor construct designed for carrying out the invention 
using a transposase and a site-specific endonuclease. 

Figure 14 shows pug targeting mechanism. The extrachromosomal targeting molecule 
produced by FLP excision and iSed cutting is shown at the top. The endogenous^ locus 
is shown in the middle with the direction of transcription being from left to right. The genomic 
structure resulting from homologous recombination is depicted at the bottom. The probe used 
in Southern blot analysis (Figure 15) and selected restriction fragments are shown with sizes 
indicated in kb. Restriction sites are R: EcoRI, B: BamHL 

Figure 15 shows Southern blot analysis of a p«g targeting event. Fly DNA was 
digested with EcoRI and BamHL The membrane was hybridized with a 2.5 kb pug probe 
(Figure 14). Dine 1: molecular markers with indicated sizes. Lane 2: pug* control showing 
the endogenous 9 kb band. Lane 3: DNA from flies homozygous for the targeted^ allele 
showing, as predicted, the 7 kb and the 10 kb fragments. 

Figure 16 is a diagram showing steps for generating a null mutation of a Target Gene 
(TG). The top line shows both the donor construct, shown as a loop having ate gene, an I-Crel 
site (Q, a first flanking homologous segment (FH-1) shown with a gap to indicate an I-Scel she, 
and a second flanking homologous region (FH-2) aligned with a segment of the genome, showii 
as a straight line having TG flanked by FH-1 and FH-2. The second line diagrams the structure 
after I-Scel cutting and homologous recombination in the FH-1 region. The third line diagrams 
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an alignment of segments of the structure of line two after I-Crel cutting. The bottom line 
diagrams the resulting genomic structure after homologous recombination within FH-2. 

Figure 17 is a diagram of a donor construct (top line) structured for ends-in targeting 
using a combination of transposase and unique endonuclease. Transposase-recognizable 
inverted repeats (ER), I-Scel site (I), target gene modifying sequences (TGMS) and selectable 
marker gene (SMG) are identified. The bottom line shows the alignment of the 
recombinogenic donor and the target after transposase and endonuclease action. 

Figure 18 is a diagram of targeting using a donor construct (top line) having two I-Scel 
sites (I) but no recombinase or transposase recognition sites. Other abbreviations as in Figure 
17. DR = direct repeat 

Figure 19 is a diagram of targeting by the ends-out method through y 7 rescue. 

Figure 20 is a diagram of ends-out replacement. 

Figure 21 is a diagram of the targeting vector pTV2. 

Figure 22 is a diagram showing a simplified targeting screen. 

Figure 23 is a diagram of a crossing scheme used to eliminate the mapping and marking 
steps as a prerequisite for targeting. 

Figure 24 is a diagram showing that the stable traasfonnant step can be bypassed and 
somatic cell nuclei can be used to generate clones: yellow + clones in somatic cells of flies 
after coinjection of yellow donor DNA and I-Scel encoding mRNA. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to methods and compositions for carrying out gene 
targeting. In contrast previously known methods for gene targeting in multicellular organisms, 
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the present invention does not depend on availability of a pluripotential cell line, and is 
adaptable to any organism. Any gene of an organism can be modified by the method as the 
method exploits homologous recombination processes that are endogenous in the cells of all 
organisms. 

The methods of gene targeting of the invention fell into two general categories which 
both rely on homologous recombination: (A) the release only method, and (B) the release and 
cut method. Both methods involve the transformation of an organism with a donor construct 
of the invention. The release only method can be implemented through a variety of 
embodiments, including but not limited to , flanking a target gene and optional marker gene(s) - 
in the donor construct with (1) transposons, (2) rare-cutting endonuclease sites, and (3) a 
transposon and rare-artting endonuclease site. The release and cut method can be implemented 
through a variety of embodiments, including but not limited to, flanking a target gene and 
optional marker gene(s) in the donor construct with (1) site-specific recombinase target sites 
and cutting with a rare-cutting endonuclease, and (2) site-specific recombinase target sites and 
cutting with transposons. Other schemes based on these general concepts are within the scope 
and spirit of the invention, and are readily apparent to those skilled in the art 

The following terms are used herein according to the following definitions. 

"Gene targeting" is a general term for a process wherein homologous recombination 
occurs between DNA sequences residing in the chromosome of a host cell or host organism 
and a newly, introduced DNA sequence. 

"Host organism" is the term used for the organism in which gene targeting according 
to the invention is carried out. 

"Target" refers to the gene or DNA segment subject to modification by the gene 
targeting method of Ae present invention Normally, the target is an endogenous gene, coding 
segment, control region, intron, exon, or portion thereof, of the host organism. The target can 
be any part or parts of genomic DNA. 
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"Target gene modifying sequence" is a DNA segment having sequence homology to the 
target but differing from the target in certain ways, in particular with respect to the specific 
desired modification(s) to be introduced in the target. 

5 "Unique endonuclease site" is a recognition site for an endonuclease that catalyzes a 

double strand break in DNA at the site. Any recognition site that does not otherwise exist in 
the host organism, or does not exist at a site where double-strand breakage is harmful to the 
host organism, can serve as a unique endonuclease site for that organism. "Unique" is 
therefore an operational term. Furthermore, modified host organisms may be generated in 

10 which an endogenous site or sites have been modified so that they are no longer recognized by 
the endonuclease. Such a modified host organism can be generated by expressing the 
endonuclease in the organism and selecting for individuals that are resistant to harmful effects 
of such expression. Such resistant individuals can arise by cutting followed by inaccurate 
repair of the break and consequent alteration of the recognition sequence. Alternatively, within 

IS a population of individuals, pre-existing polymorphisms may already exist and be selected for 
by expression of the endonuclease. Many classes of enzymes catalyze double-strand DNA 
breakage in a site-specific manner, identified by a specific nucleotide sequence at or near the 
break point. Such enzymes include, but are not limited to transposases, recombinases and 
homing endonucleases. By introducing the nucleotide sequence of a unique endonuclease site 

20 into a donor construct, a double-strand break can be generated at or near that site by action of 
the appropriate endonuclease. A preferred class of unique endonuclease sites of practical 
utility are the homing endonuclease or rare-cutting endonuclease sites. The rare-cutting 
endonuclease sites are typically much longer than restriction endonuclease sites, usually ten or 
more base pairs in length and thus occur rarely, if at all, in a given host organism. For a 

25 review of the rare-cutting endonucleases and details of their recognition site sequences see 
Belfort, M., et al, (1997) Nucl. Acids Res. 25:3379-3388, incorporated herein by reference. 
Some of the rare-cutting endonucleases are encoded by organelle genomes, and the coding 
sequences may use non-standard coding. The coding sequences of many such endonucleases 
are known and have, or can be, modified to be expressible from a chromosomal locus. The 

30 expression can be controlled, if desired, by an inducible promoter. In principle, any rare- 
cutting endonuclease can be employed in the practice of the invention, including, for example 
I-Crel, I-Scel, I-Tli, I-Ceul, I-Ppol and PI-PspI. 
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"Marker" is the term used herein to denote a gene or sequence whose presence or 
absence conveys a detectable phenotype of the organism. Various types of markers include, 
but are not limited to, selection makers, screening markers and molecular markers. Selection 
5 markers are usually genes that can be expressed to convey a phenotype that makes the 
organism resistant or susceptible to a specific set of conditions. Screening markers convey a 
phenotype that is a readily observable and distmguishable trait. Molecular markers are 
sequence features that can be uniquely identified by ofigonucleotide probing, for example 
RFLP (restriction fragment length polymorphism), SSR markers (simple sequence repeat) and 
10 the like. 

"Donor construct" is the term used herein to refer to the entire set of DNA segments 
toteintrodijcediirtotheho^ 

sequences), one or more unique endonuclease sites, one or more markers, and optionally one 
15 or more recombinase target sites as well as other DNA segments as desired. In one 
embodiment of the invention, the donor construct is flanked by transposon target sites so that 
the donor construct becomes integrated somewhere in the host genome after being introduced 
into host cells. An excisable donor construct is one which can be excised (freed) from its 
location on the host chromosome or on an extrachromosomal plasmid, by the action of an 
20 inducfole enzyme, for example, a unique restriction enzyme or a re^ . 
excisable, the donor construct must be flanked by recognition sites for the excising enzyme. 
For example, in the upper diagram of Figure 2, the donor construct is flanked by FRT sites 
which render the construct excisable by the Hp recombinase. 



25 



30 



"Recombinogenic donor" is the term used herein to describe the structure of that part 
of the donor construct resulting from the action of the unique endonuclease and, if so designed, 
the recombinase. The recombinogenic donor is not integrated in the host chromosome and is 
characterized by having segments homologous to the target interrupted by a double-strand 
break for ends-in targeting, or having segments homologous to the target flanked by broken 
ends in the case of ends-out targeting. For example, a recombinogenic donor resulting from 
the action of a unique endonuclease acting on a recognition site introduced into a target gene 
modifying sequence could have a structure as diagramed in the lower part of Figure 2, a linear 
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Many of the genetic constructs used herein are described in terms of (he relative 
positions of the various genetic elements to each other. "Adjacent" is used to indicate that two 
genetic elements are next to one another without implying actual fusion of the two sequences. 
For example, two segments of DNA adjacent to one another can be separated by 
5 oligonucleotides providing a restriction she, or having no apparent function. , "Flanking" is 
used to indicate that the same, similar, or related sequences exist on either side of a given 
sequence. For example, in the upper diagram of Figure 2, the y + gene is shown flanked 
by P2t segments. That construct is in turn flanked by FRT sites oriented parallel to one 
another. Segments described as "flanking" are not necessarily directly fused to the segment 
10 they flank, as there can be intervening, non-specified DNA. These and other terms used to 
describe relative position are used according to normal accepted usage in the field of genetics. 

The method of the invention can be used for gene targeting in any organism. 
Minimum requirements include a method to introduce genetic material into the organism (either 

15 stable or transient transformation), existence of a unique endonuclease that can be expressed 
in the host organism (or a modified host organism) without harming the organism, and 
sequence information regarding the target gene or a DNA clone thereof. The efficiency with 
which homologous recombination occurs in the cells of a given host varies from one class of 
organisms to another. However the use of an efficient selection method or a sensitive screening 

20 method can compensate for a low rate of homologous recombination. Therefore the basic tools 
for practicing the invention are available to those of ordinary skill in the art for such a wide 
range and diversity of organisms that the successful application of such tools to any given host 
organism is readily predictable. 

25 Transformation can be carried out by a variety of known techniques , depending on the 

organism, on characteristics of the organism's cells and of its biology. Stable transformation 
involve^DNA entry into cells and into the cell nucleus. For single-celled organisms and 
organisms that can be regenerated from single cells (which includes all plants and some 
mammals), transformation can be carried out by in vitro culture, followed by selection for 

30 transformants and regeneration of the transformants. Methods often used for transferring DNA 
or RNA into cells include micro-injection, particle gun bombardment, forming DNA or RNA 
complexes with cationic lipids, liposomes or other carrier materials, electroporation, and 
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incorporating transforming DNA or RNA into virus vectors. Other techniques are known in 
the art. For a review of the state of the art of transformation, see standard reference works 
such as Methods in Enzymology, Methods in Cell Biology, Molecular Biology Techniques, 
all published by Academic Press, Inc. N.Y. DNA transfer into the cell nucleus occurs by 
5 cellular processes, and can sometimes be aided by choice of an appropriate vector, by including 
integration site sequences which can be acted upon by an intracellular transposase or 
recombinase. For reviews of transposase or recombinase mediated integration see, e.g., Craig, 
N.LK. (1988) Ann. Rev. Genet. 22:77; Cox, M.M. (1988) In Genetic Recombination (R. 
Rucherlapati and G.R. Smith, eds.) 429-443, American Society for Microbiology, Washington, 

10 D.C.; Hoess, R.H. et al. (1990) In Nucleic Acid* and Molecular Biology (F. Eckstein and 
D.M.J. Lilley eds.) Vol. 4, 99-109, Springer-Verlag, Berlin. Direct transformation of 
multicellular organisms can often be accomplished at an embryonic stage of the organism. For 
example, inDrosophila, as well as other insects, DNA can be micro-injected into the embryo 
at a multinucleate stage where it can become integrated into many nuclei, some of which 

15 become the nuclei of germ line cells. By incorporating a marker as a component of the 
transforming DNA, non-chimeric progeny insects of the original transformanf individual can 
be identified and maintained. Direct microinjection of DNA into egg or embryo cells has also 
been employed effectively for transforming many species. In the mouse, the existence of 
pluripotent embryonic stem (ES) cells that are culturable in vitro has been exploited to generate 

20 transformed mice. The ES cells can be transformed in culture, then micro-injected into mouse 
blastocysts, where they integrate into the developing embryo and ultimately generate germline 
chimeras. By interbreeding heterozygous siblings, homozygous animals carrying the desired 
gene can be obtained. Recently stable germline transformations were reported in mosquito 
(Catteruccia F., et al., (2000) Nature 405:954-962). For reviews of the methods for 

25 transforming multicellular organisms, see, e.g. Haren et al. (1999) Anna. Rev. Microbiol. 
53:245-281; Reznikoff et al. (1999) Biochem. Biophys. Res. Common. Dec.29:266(3):729- 
734; Ivies et al. (1999) 60:99-131; Weinberg (1998) Mar.26:8(7):R244-247; Hall et al. (1997) 
FEMS Microbiol Rev. Sep:21(2): 157-178; Craig (1997) Annu. Rev. Biochem. 66:437-474; 
Beall et al. (1997) Genes Dev. Aug.l5:ll(16):2137-2151. Transformed plants are obtained 

30 by a process of transforming whole plants, or by transforming single cells or tissue samples 
in culture and regenerating whole plants from the transformed cells. When germ cells or seeds 
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control, so that the selection can be activated or inactivated under the control of the 
investigator. Positive selection can be provided, for example, by a gene conferring resistance 
to an antibiotic or other toxin so that in the presence of the toxin cells lacking the resistance 
are less viable than cells possessing the resistance. Similarly, negative selection is provided 
by a gene conferring sensitivity to a specific compound, so mat cells possessing the gene are 
selectively killed in die presence of the toxin. The foregoing are merely examples of the great 
variety and complexity of markers used for selection, and of selection systems in general which 
are known in the art, and fundamental to the practice of genetics. Markers for screening are 
those which convey an identifiable trait (phenotype) to cells or organisms possessing the 
marker,- which trait is lacking in cells or organisms Oiat do not possess the marker. An antigen 
not normally present in the organism or in individual cells can serve as a screening marker, 
using a fluorescent-tagged antibody or other tag to identify the antigen's presence. Many 
screening markers are known and available to those skilled in the art. The use of markers is 
exemplified for various aspects of the invention, however it will be understood that the manner 
of using markers and the choice of a particular marker type in a given situation is well- 
understood in the art, and that the invention does not depend on the use of any particular type 
of marker. 

"Recombination," in the context of the present invention, is a term for a process in 
which genetic material at a given locus is modified as a consequence of an interaction with 
other genetic material. "Homologous recombination" is recombination occurring as a 
consequence of interaction between segments of genetic material that are homologous, or 
identical, at least over a substantial length of nucleotide sequence. The minimal necessary 
length is functionally defined and may vary from cell to cell, or organism to organism (i.e., 
between species). Homologous recombination is an enzyme-catalyzed process that occurs in 
essentially all cell types. The reaction takes place when nucleotide strands of homologous 
sequence are aligned in proximity to one another and entails breaking phosphodiester bonds 
in the nucleotide strands and rejoining with neighboring homologous strands or with an 
homologous sequence on the same strand. The breaking (cutting) and rejoining (splicing) can 
occur with precision such that sequence fidelity is retained. Homologous recombination 
between a target gene and a donor construct of identical sequence except for a marker can 
result in reconstitution of the target, distinguishable only by the presence of the marker. 
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Homologous recombination occurs only rarely, if ever, unless the donor and the target can be 
present in physical proximity to one another. In one embodiment of the invention, the donor 
construct is integrated at a chromosomal site that is not near the target. The cells are then 
provided with means for freeing the recombinogenic donor from its chromosomal locus to 
5 allow homologous recombination to take place. In another embodiment, the donor construct 
is present in the cell but not integrated into the chromosome, for example as an autonomously 
replicating plasmid or as a non-replicating, transiently present plasmid. In either of the latter 
cases, the donor construct is already free to approach the target and die action of rendering the 
donor recombinogenic by introducing a double strand DNA break stimulates homologous 

10 recombination with the target. The frequency of homologous recombination is influenced by 
a number of factors. Different organisms vary with respect to the amount of homologous 
recombination that occurs in their cells and the relative proportion of homologous to non- 
homologous recombination that occurs is also species-variable. The length of the donor-target 
region of homology affects the frequency of homologous recombination events, the longer the 

15 region of homology, the greater the frequency. The length of the homology region needed 
to observe homologous recombination is also species-variable. However, differences in the 
frequency of homologous recombination events can be offset by the sensitivity of selection for 
the recombinations that do occur. With sufficiently sensitive selection, e.g., by choosing a 
combination of positive and negative selection, virtually every recombination event can be 

20 identified. Other factors, such as the degree of homology between the donor and the target 
sequences will also influence the frequency of homologous recombination events, as is well- 
understood in the art. It will be appreciated that absolute limits for the length of the donor- 
target homology or for the degree of donor-target homology cannot be. fixed, but depend on 
the number of potential events which can be scored and the sensitivity of selection. Where it 

25 is possible to screen 10 9 events, for example, in cultured cells, a selection that can identify 1 
recombination in 10 9 cells will yield useful results. Where the organism is larger, or has a 
longer generation time, such that only 100 individuals can be scored in a single test, the 
recombination frequency must be higher and selection sensitivity is less critical. All such 
factors are well known in the art, and can be taken into account when adapting the invention 

30 for gene targeting in a given organism. The invention can be most readily carried out in the 
case of organisms which have rapid generation times or for which sensitive selection systems 
are available, or for organisms that are single-celled or for which pluripotent cell lines exist 

20 
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that can be grown in culture and which can be regenerated or incorporated into adult 
organisms. In the former case, the invention is demonstrated for the fruit fly, Drosophila. 
The latter case is demonstrated with a plant, Arabidopsis. These organic are representee 
of their respective classes and the description demonstrates how the invention can be applied 
throughout those classes. It win be understood by those skilled in the art tnat the invention is 
operative independent of the method used to transform the organism. Further, the fact that the 
invention is applied to such disparate organisms as plants and insects demonstrates the 
widespread applicability of the invention to living organisms generally. 



The organisms in which gene targeting can be accomplished according to the invention 
include, but are not limited to: insects, including insect species of the orders Coleoptera, 
Diptera, Hemiptera, Homoptera, Hymenoptera, Lepidoptera and Orthoptera; plants, including 
both monocotyledonous plants (monocots) including, but not limited to, maize, rice, wheat, 
oats and other grain crops, and dicotyledonous plants (dicots) mcluding, but not limited to, 
15 potato, soybean and other legumes, tomato, members of the Brassica family, Arabidopsu, 
tobacco, grape and ornamental species such as roses, carnations, orchids and the like,' 
inammals, including known transformable species such as mouse, rat, sheep, and pig, and 
others, as transformation methods are developed, including bovine and primates including 
humans; birds, including food species such as chicken, turkey, duck and goose; fish, including 
20 species raised for food or sport including trout, salmon, catfish, tilapia, ornamental breeds such 
as koi and goldfish, and the like; and shellfish, including oyster, clam, shrimp and the like. 
Gene targeting in such organisms is useful to accomplish genetic modification to impart 
disease resistance, improve hardiness and vigor, remove genetic defects, improve product 
quality or yield, impart new desirable traits , alter growth rates or in the case of pest species 
25 and disease vectors, introduce, alter or remove genes affecting the ability of the pest or vector 
to spread disease or cause damage. 

It will be understood that the invention is also useful for gene targeting in somatic cells 
and tissues, and is not limited to germ line or phiripotent cells. Targeting in somatic cells 
provides the ability to make desired and specific genetic modification to target host cells and 
tissues. Targeting in somatic cells now provides a means of producing transgenic animals 
through the nuclear transfer technique (McCreath, K. J. et al. (2000) Nature 405:106&-1069; 
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Polejaeva, L A. et al., (2000) Nature 407:86-90). Transformation methods using tissue or 
cell-type-specific vectors can be employed for providing a desired donor construct in the cells 
of choice, or the cells can be transformed by non-specific means, using tissue-specific 
promoters to ensure activation of targeting the cells of choice. Obvious choices include tumor 
5 cells and specific tissues affected by a genetic defect. The methods of the invention are 
therefore useful to expand and supplement the available techniques of gene therapy. 

A factor which influences targeting efficiency is the extent of homology or 
nonhomology between donor and target. There are many reports showing that increased 

10 donontarget homology increases the absolute targeting frequency in mammalian cells, see e.g. , 
M. J. Shulman et al. (1990) Mol Cell Biol 10:466, C. Deng, M.R. Capecchi (1992) Mol 
Cell. Biol 12:3365. JuDrosopfdla, investigators have examined the effect of homology in the 
context of P transposon break-induced gene conversion. The ds break that is left behind when 
a P element transposes is a substrate for gene conversion, and may use ectopically-located 

15 homologous sequences as a template. Dray and Gloor (, J. B. Scheeber, G. M. Adair (1994) 
Mol Cell Biol 14:6663; T Dray, HG. B. Gloor (1997) Genetics 147:684) found that as little 
as 3 kb of total template:target homology sufficed to copy a large non-homology segment of 
DNA into the target with reasonable efficiency. In prior work on FLP-mediated DNA 
mobilization, very different efficiencies were observed for HJP-mediated integration at a target 

20 FRT when comparing experiments in which the donor and target shared different extents of 
homology (M. M, Golic (1997) Nucleic Acid Res. 25:3665). Integration was approximately 
10-fold more efficient when the donor and target shared 4. 1 kb of homology than when they 
shared only 1.1 Kb of homology, suggesting the possibility that interactions between an 
extrachromosomal DNA molecule and a chromosomal sequence may be stabilized to some 

25 degree by shared sequences. If the extent of homology is an important factor, increasing the 
extent of donontarget homology may increase die overall frequency of targeting, and as a 
consequence provide a means to shift the ratio of targeted to non-targeted events. The limited 
data available from Drosophila leads us to conclude that 2-4 kb of donontarget homology is 
sufficient for efficient targeting, although in the experiment of Example 2 the donor and target 

30 shared 8 kb of homology. 
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The gene targeting technique of the invention is efficient enough that chemical or 
genetic selection methods were not needed for the described embodiment but these can be 
implemented as part of the scheme if desired. Furthermore, me procedure in general does not 
require special lines of cultured cells, as does mouse gene targeting. Because the technique 
can be carried out in the intact organism it can be used for gene targeting in many other species 
of animals and plants, with the only requirement being that a method of transformation exist. 

It will be understood that for each of the specific features of the process of the 
invention as just described there exists a panoply of functional equivalents which can be 
employed, as desired and as appropriate, to carry out the invention. 



Use of other site-specific recombinases and/or site-specific endonucleases. 

There are a large number of site-specific endonucleases known that function similarly 
to FLP, and that can be substituted in this procedure. For example the Cre recombinase and 
15 its lox target site can be employed instead of the FLP-FRT system. Many other site-specific 
endonucleases are listed by Nunes-Duby et al (1998) Nucleic Acids Research 26:391^106, and 
there are no doubt many yet to be found. 

The I-Scel intron-homing endonuclease is also one of a large number of functionally 
20 similar rare-cutting endonucleases. Many of these, for instance I-Tlil, I-Ceul, I-Crel, I-Ppol 
and PI-PspI, can be substituted for I-Scel in the targeting scheme. Many are listed by Belfort 
and Roberts (1997) Nucleic Acids Research 25:3379-3388). Many of these endonucleases 
derive from organelle genomes in which the codon usage differs from the standard nuclear 
codon usage. To use such genes for nuclear expression of their endonucleases it may be 
25 necessary to alter the coding sequence to match that of nuclear genes. This can be done by 
synthesizing the gene as a series of oligonucleotides, that are then ligated together in the proper 
order to produce a segment of DNA that encodes the entire endonuclease with nuclear codon 
usage. 

30 Introduction of mutations. 

The gene targeting technique described herein can be used to substitute one allele for 
another at the targeted locus. This provides a way to insert large or small mutations into a 
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targeted locus, or to convert a mutant allele into the wild-type allele. In cases where the 
mutant phenotype of the targeted gene is unknown, molecular techniques, such as PCR, can 
be used to detect the mutated allele. A two-step method that provides a simple genetic method 
to detect allelic, substitutions can also be used (Figure 9). 

To make a donor construct, a cloned copy (or partial copy) of the target gene is 
engineered to carry the desired mutation and an I-^Scel cut site. In this example a simple point 
mutation is introduced, for instance a change of a coding codon to a stop codon. This 
technique is not limited to point mutations; insertions or deletions of varying sizes can be 
introduced also. The introduced mutation may be placed to the left or right of die I-Scel 
recognition site; in Fig. 9 his shown to the right for illustrative purposes only. Thedonor 
version of the target is placed into a transposon vector between FRTs, along with a marker 
gene (such as the white* eye color gene), and a cut site for a second site-specific endonuclease 
(such as I-Crel), and transformed into Drosophila. The engineered mutation is then 
recombined into the target gene as a Class H (Fig. 6) targeting event by simply screening for 
altered chromosomal linkage of the marker gene. The product is a tandem duplication with 
a point mutation in one copy, and the marker gene and I-Crel cut she between the tandem 
copies of the target gene. Molecular analysis is used to confirm Ihe presence of the introduced 
mutation. 

In the second step, I-Crel endonuclease is introduced into the flies produced in step 1 
(using a transgene or any of several other methods discussed here). This endonuclease cuts 
die chromosomes in the region between the tandem repeats, causing frequent reduction of the 
two tandem copies to a single copy by recombination (as shown by the data of Figure 1). Loss 
of the tandem repeat is easily recognized because die w + marker gene is lost in die process. 
In a fraction of the cases, the crossover that eliminates die tandem duplication will occur to 
the right of the point mutation, and the resultant allele carries the introduced mutation. 
Molecular or genetic analysis can be used to determine which of the marker-loss alleles carry 
the mutation, using methods and markers known to those skilled in the art. 

The foregoing two step method requires no knowledge of the mutant phenotype. It is 
based simply on the segregation and then loss of a marker gene. A variation of the foregoing 
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procedure is to introduce two point iniitationsmlofliedomjra^of^g^: one<meadisi^ 
of the I-Scel cut site. In this case, the two alleles of the target gene in toe tandem duplication 
would each be mutated. Molecular analysis is used to confirm the presence of both point 
mutations. Step 2, as described, is not be necessary in order to generate a mutant organism. 
Moreover, because a marker gene is present between the mutant alleles, it is very easy to 
follow the segregation of the mutant locus through crosses. 

This procedure can also provide a way to select for the survival of the mutant 
organisms. For instance, if the marker gene was a chemical resistance gene, then treatment 
of the organisms with the chemical selects for those carrying the tandem dupUcation, and the 
engineered alleles. 

If desired, step 2 can be implemented to reduce the two mutant alleles to a single 
mutant allele. Only crossovers mat occurred between the two mutations would restore the 
wild-type; all others produce an allele carrying one or the other mutation. 

A two-step process can be employed for generating a null mutation of a target gene. 
Two homologous recombmations are targeted for flanking homologous segments on either side 
of the target gene resulting in a deletion of the target gene, as diagramed in Fig. 16. The 
donor construct includes a first flanking homologous segment carrying a unique endonuclease 
site, such as I-Scel, a second flanking homologous segment, a recombinase gene, such as I- 
Crel and a recombinase recognition site, such as tax. In the target genome, the target gene lies 
between the two flanking homologous segments. A double strand break induced in the donor 
by I-Scel endonuclease stimulates homologous recombination in the first flanking homologous 
segment which integrates the donor construct into the genome as shown in the first step of Fig. 
16. Induction of I-Crel results in a cleavage at its recognition site to allow pairing and 
recombination within the second flanking homologous segment, as shown in the second step 
of Fig. 16. The effect of the second recombination event is deletion of the target gene and 
retention of the flanking homologous segments, as shown in the bottom line of Fig. 16. 
Appropriate selection markers can be incorporated to identify stages of the process. Deletion 
of the target can, itself, serve as a selectable event, depending on the null phenotype. Other 
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techniques of deletion targeting or replacement targeting can be employed, as known in die art, 
for example, by employing an ends-out targeting construct. 

Targeting by use of a site-specific endonuclease only. 

5 Donor constructs can also be engineered to contain two unique endonuclease cut sites 

such as I-Scel sites that flank a cloned donor version of the target locus and a marker gene. 
The cloned donor could be engineered in two halves so that the right half of the donor version 
of the target gene is located at the left end of the construct and vice-versa, with the marker 
gene between the halves. After introducing such a construct into the organism, double cutting 

10 at the flaking sites releases a*donor molecule that is essentially identical to the released donor 
molecule shown in the lower half of Figure 2. 

Ends-out targeting. 

Ends-out targeting can also be applied using a site-specific recombinase and unique 
1 5 endonuclease to release die donor molecule, or using only a unique site-specific endonuclease, 
but including two sites for site-specific endonuclease cutting within the donor construct. A 
donor construct intended for ends-out targeting is prepared by providing that the coding 
sequences of segment lying on either side of the inserted endonuclease site are in antiparallel 
orientation with respect to one another. Where the normal coding sequence of the target is 
20 abcdefgh, insertion of an endonuclease site between d and e provides abcd/efgh, where the two 
parts separated by die cleavage site are in parallel orientation. Cleavage yields dcba — hgfe 
which can recombine by "ends-in" recombination. For ends-out targeting the antiparallel 
orientation is constructed, dcba/hgfe, which upon cleavage yields abed — efgh. See Fig. 3. 

25 Other ends-out targeting schemes are within the scope of the invention. Such schemes 

can involve the incorporation of a negatively selectable marker at a site which can be used to 
favor targeted over non-targeted insertions or at a site which can be used to eliminate progeny 
with the donor chromosome. 

30 Use in other insects. 

The method of the invention can be applied to other insects also. For a review of 
genetic manipulations in insects see Insect Transgenesis Methods and Applications, Handler, 
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A. M. , and A. A. James eds. (2000) CRC Press, Boca Raton, Florida, which is incorporated 
by reference in its entirety. One potential problem in other insects is a paucity of genetic 
markers that can be followed to do the segregation screening. Thispancity of markers applies 
to many other organisms in which the invention can be used for gene targeting. The problem 
can be dealt with by placing two dominant markers in the donor transgene. One of the markers 
(for instance a green fluorescent protein [GFP] gene) would be placed outside the FRTs. The 
second marker (for instance a chemical resistance gene) would be placed between the FRTs 
along with the target locus. After treeing the donor construct the first marker will stay in 
place, while the second marker will accompany the donor targeting DNA to the targeted locus . 
Therefore, after induction of FLP and I-Scel enzymes, screening can be carried out by looking 
for animals that are resistant to the chemical, but which do not show GFP fluorescence. These 
would be individuals in which the resistance gene had segregated from the GFP donor 
chromosome marker gene. Targeting can be verified by molecular means. A positive-negative 
selection method can also be employed in such a screen to increase the sensitivity of 
IS recombinant detection. 

Use in other animals. 

This method can also be applied in other animals, including, but not limited to, mice, 
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humans, cattle, sheep, pigs, nematodes, amphibians, and fish. 
Use in plants. 

Targeted alteration of plant genomes can be carried out using the procedures described 

herein. 



It is contemplated that the gene targeting methods of the invention can be used in a 
variety of plants such as grasses, legumes, starchy staples, Brassica family members, herbs 
and spices, oil crops, ornamentals, woods and fibers, fruits, medicinal plants, and alternative 
and other crops. Preferably the invention can be used in plants such as sugar cane, wheat, rice, 
maize, potato, sugar beet, cassava, barley, soybean, sweet potato, oil palm fruit, tomato, 
sorghum, orange, grape, banana, apple, cabbage, watermelon, coconut, onion, cottonseed, 
rapeseed, and yam. 
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Grasses include, but arc not limited to, wheat, maize, rice, rye, triticale, oats, barley, 
sorghum, millets, sugar cane, lawn grasses, and forage grasses. Forage grasses include, but 
are not limited to, Kentucky bluegrass, timothy grass, fescues, big bluestem, little bluestem 
and blue gamma. 

Legumes include, but are not limited to, beans like soybean, broad or Windsor bean, 
kidney bean, lima bean, pinto bean, navy bean, wax bean, green bean, butter bean, and mung 
bean; peas like green pea, split pea, blade-eyed pea, chick-pea, lentils, and snow pea; peanuts; 
other legumes like carob, fenugreek, kudzu, indigo, licorice, mesquite, copaifera, rosewood, 
rosary pea, senna pods, tamarind, and tuba-root; and forage crops like alfelfa. 

Starchy staples include, but are not limited to, potatoes of any species including white 
potato, sweet potato, cassava, and yams. 

Brassica, include, but are not limited to, cabbage, broccoli, cauliflower, brussels 
sprouts, turnips, and radishes. 

Alternative and other crops include, but are not limited to, quinoa, amaranth, tarwi, 
tamarillo, oca, coffee, tea, and cacao. 

Herbs and spices include, but are not limited to, cinnamon, black and white pepper, 
cloves, nutmeg and mace, ginger and turmeric, saffron, hot chilies and other capsicum 
peppers, vanilla, allspice, mint, parsley family herbs (e.g., parsley, dill, caraway, fennel, 
celery, anise, coriander, cilantro, cumin, chervil) mustard family members (e.g., mustard and 
horseradish), and lily family members (e.g., onion, garlic, leeks, shallots, and chives). 

Oil crops include, but are not limited to, soybean, palm, rapeseed, sunflower, peanut, 
cottonseed, coconut, olive palm kernel. 



Woods and fibers include, but are not limited to, cotton, flax, and bamboo: ~ 
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Both site-specific recombinases [Dale and Ow, (1991) PNAS 88:10558-10562L Lyznik 
et al., (1996) Nucleic Acids Res. 24(19)3784-3789]; and site-specific unique endonucleases 
[Puchtaetal. (1996) PNAS 93:5055-5060] have been shown to function in plants. The two can 
be used combinatorially to bring about gene targeting in plants. 

Lloyd and Davis (1994) Mot Gen. Genetics 242:653-657 demonstrated that the 
cauliflower mosaic virus (CMV) 35S promoter and terminator can be used to direct expression 
of FLP in tobacco plants. Puchta et al. demonstrated the same method for expression of the 
JoScel endonuclease in tobacco, m other examples, recombinases have also been expressed in 
plants using heat-shock promoters [Kilby et al., (1995) The Plant J. 8:637-652; Sieburth et al., 
(1998) Development 125:4303^312]. Transformation of plants was accomplished by use of 
Agrobacterium T-DNA in those cases. Similar methodology can be used in other plants, or 
transformation of tissues of cultured cells may be accomplished by Holistic DNA-coated 
particle bombardment 

Functional recombinase and/or endonuclease activity may be achieved by transgene 
expression, by introduction of appropriate synthetic mRNAs, or introduction of the protein 
themselves. 



Essentially the entire panoply of unique endonucleases, recombinases and marker genes 
can be expressed in plants as constitutive, developmental stage-specific, or inducible 
transgenes. A variety of known inducible promoters that function in plants are available to 
those skilled in the art, including heat shock promoters. Development stage-specific promoters 
are useful, for example where it is advantageous to carry out targeting in specific cell types or 
at specific times of development; for example, during embryo development, within the cells 
of shoot apical meristem, or in mother cells that undergo meisosis.. A number of such 
promoters are known; e.g., the NZZ promoter [Schiefthaler, et al. (1999) Proc. Natl. Acad. 
Sci. USA 96:11664-11669]; SPL [Yang et al (1999) Genes and Development 13:2108-2117] 
DEF1 rjBhatt et al (1999) Plant J. 19:463-*72]; SYN1 [Bai et al (1999) Plant Cell 11:417-430]. 
0 ASK1 [Yang et al. (1999) Proc. Natl. Acad. Sci. USA 96:11416-11421]; AtDMCl [Klimyuk 
and Jones (1997) Plant J. 11 : 1-14] . 
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Techniques and agents for introducing and selecting for the presence of heterologous 
DNA in plant cells and/or tissue are well-known. Selection can be positive or negative. 
Genetic markers allowing for the selection of heterologous DNA in plant cells are well-known, 
e.g., genes carrying resistance to an antibiotic such as kanamycin, hygromycin, gentamycin, 
5 or bleomycin. The marker allows for selection of successfully transformed plant cells growing 
in the medium containing the appropriate antibiotic because they will carry the corresponding 
resistance gene . In most cases the heterologous DNA which is inserted into plant cells contains 
a gene which encodes a selectable marker such as an antibiotic resistance marker, but this is 
not mandatory. An exemplary drug resistance marker is the gene whose expression results 

10 in kanamycin resistance, i.e. , the chimeric gene containing nopaline synthetase promoter, Tn5 
neomycin phosphotransferase II and nopaline synthetase 3' non-translated region described by 
Rogers et al., Methods for Plant Molecular Biology, A. Weissbach and H. Weissbach, eds., 
Academic Press, Inc., San Diego, CA (1988). Negative selectable markers which can be used 
in the invention include, but are not limited to, codA [Stougaard (1993) Plant Journal 3:755- 

15 761] tms2 [Depicker et al., (1988) Plant Cell Rep. 7:63-66] nitrate reductase [Nussame et al 
., (1991) Plant Journal V261-ri4\ and SU1 [O'keef et al. (1994) Plant Physiol 105:473-482]. 

Techniques for genetically engineering plant cells and/or tissue with an expression 
20 cassette comprising an inducible promoter or chimeric promoter fused to a heterologous coding 
sequence and a transcription termination sequence are to be introduced into the plant cell or 
tissue by Agrobacterium-m&lizted transformation, electroporation, microinjection, particle 
bombardment or other techniques known to the art The expression cassette advantageously 
further contains a marker allowing selection of the heterologous DN A in the plant cell, e.g. , 
25 a gene carrying resistance to an antibiotic such as kanamycin, hygromycin, gentamycin, or 
bleomycin. Assays for phenolic acid esterase and/or xylanase enzyme production are taught 
herein or in U.S. Patent No. 5,824,533, for example, and other assays are available to the art. 

A DNA construct carrying a plant-expressible gene or other DNA of interest can be 
30 inserted into the genome of a plant by any suitable method. Such methods may involve, for 
example, the use of liposomes, electroporation, diffusion, particle bombardment, 
microinjection, gene gun, chemicals that increase free DNA uptake, e.g., calcium phosphate 
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copretipitation, viral vectors, and other techniques practiced in the art. Suitable plant 
transformation vectors include those derived from a Ti plasmid of Agrobacterium tumefatiens, 
such as those disclosed by Herrera-Estrella (1983), Bevan (1983), Klee (1985) and EPO 
publication 120,516 (Schilperoort et al.). In addition to plant transformation vectors derived 
5 from the Ti or root-inducing (Ri) plasmids of Agrobacterium, alternative methods can be used 
to insert the DNA constructs of this invention into plant cells. 

The choice of vector in which the DNA of interest is operatively linked depends 
directly, as is well known in me art, on the functional properties desired, e.g., replication, 
10 protein expression, and the host cell to be transformed, these being limitations inherent in the 
art of constructing recombinant DNA molecules. The vector desirably includes a prokaryotic 
replicon, i.e., a DNA sequence having the ability to direct autonomous rephcation and 
maintenance of the recombinant DNA molecule extra-chromosomaUy when introduced into a 
prokaryotic host cell, such as a bacterial host cell. Such replicons are well known in the art. 
15 In addition, preferred embodiments that include a prokaryotic replicon also include a gene 
whose expression confers a selective advantage, such as a drug resistance, to the bacterial host 
cell when introduced into those transformed cells. Typical bacterial drug resistance genes are 
those that confer resistance to ampicillin or tetracycline, among other selective agents. The 
neomycin phosphotransferase gene has the advantage that it is expressed in eukaryotic as well 
as prokaryotic cells. 

Typical expression vectors capable of expressing a recombinant nucleic acid sequence 
in plant cells and capable of directing stable integration within the host plant cell include 
vectors derived from the tumor-inducing (Ti) plasmid of Agrobacterium tumefatiens described 
by Rogers et al. (1987) Meth. in Enzymol. 153:253-277, and several other expression vector 
systems known to function in plants. See for example, Verma et al., No. WO87/00551; 
Cocking and Davey (1987) Science 236:1259-1262. 

A transgenic plant can be produced by any means known to the art, including but not 
limited to Agrobacterium tumefaciens-me^ated. DNA transfer, preferably with a disarmed T- 
DNA vector, electroporation, direct DNA transfer, and particle bombardment [see Davey et 
al. (1989) Plant Mol. Biol. 13:275; Walden and Schell (1990) Eur. J. Biochem. 192:563; 
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Joersbo and Burnstedt (1991) Physiol Plant. 81:256; Potrykus (1991) Annu. Rev. Plant 
Physiol. Plant Mol. Biol 42:205; Gasser and Raley (1989) Science 244:1293; Leemans (1993) 
Bio/Technology. 11:522; Beck et al. (1993) Bio/Technology. 11:1524; Koziel et al. (1993) 
Bio/Technology. 11:194; and Vasil et al. (1993) Bio/Technology. 11:1533). Techniques are 
well-known to the art for the introduction of DNA into monocots as well as dicots, as are the 
techniques for culturing such plant tissues and regenerating those tissues. 

Many of the procedures useful for practicing the present invention, whether or not 
described herein in detail, are well known to those skilled in the art of plant molecular biology. 
Standard techniques for cloning, DNA isolation, amplification and purification, for enzymatic 
reactions involving DNA ligase, DNA polymerase, restriction endonncleases and the like, and 
various separation techniques are those known and commonly employed by those skilled in die 
art. A number of standard techniques are described in Sambrook et al. (1989) Molecular 
Cloning, Second Edition, Cold Spring Harbor Laboratory, Plainview, New York; Maniatis et 
al. (1982) Molecular Cloning, Cold Spring Harbor Laboratory, Plainview, New York; Wu 
(ed.) (1993) Meth Enzymol 218, Part I; Wu (ed.) (1979) Meth Enzymol. 68; Wu et al. (eds.) 
(1983) Meth. Enzymol. 100 and 101; Grossman and Moldave (eds.) Meth. Enzymol. 65; Miller 
(ed.) (1972) Experiments in Molecular Genetics, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York; Old and Primrose (1981) Principles of Gene Manipulation, University of 
California Press, Berkeley; Schleif and Wensink (1982) Practical Methods in Molecular 
Biology, Glover (ed.) (1985) DNA Cloning Vol. I and H, BRL Press, Oxford, UK; Haines and 
Higgins (eds.) (1985) Nucleic Acid Hybridization, IRL Press, Oxford, UK; and Setlow and 
Hollaender (1979) Genetic Engineering: Principles and Methods, Vols. 1-4, Plenum Press, 
New York, Kaufman (1987) in Genetic Engineering Principles and Methods, J.K. Setlow, ed., 
Plenum Press, NY, pp. 155-198; Fitchen et al. (1993) Annu. Rev. Microbiol. 47:739-764; 
Tolstoshev et al. (1993) in Genomic Research in Molecular Medicine and Virology, Academic 
Press. Abbreviations and nomenclature, where employed, are deemed standard in the field 
and commonly used in professional journals such as those cited herein. 

By crossing, a plant that carries a site-specific recombinase and a unique site-specific 
endonuclease transgenes, under control of the same promoter, can be constructed. 
Alternatively, both transgenes could be placed within the same T-DNA (or other) 
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transformation construct, and transf onnants selected by expression of a linked resistance gene, 
such as hygromycin resistance, techniques which are well-known in the art. Although the 
representative embodiment described below refers to transformation by using T-DNA it will 
be understood that other transformation methods are available to those skilled in the art, for 
5 those plant species, notably monocots, that are less amenable to T-DNA transformation. 

A donor construct can be constructed as diagramed in Figure 10. The construct carries 
a chemical resistance gene between recombinase target sites, for instance a kanamycin 
resistance gene as used by Lloyd and Davis. A cloned copy of the target gene with a site- 
10 specific unique endonuclease cut site within it is also placed between the recombinase target 
sites. The donor construct carries a second marker gene, for instance GFP (green fluorescent 
protein) or GUS (beta-glucuronidase), outside of the recombinase target sites. Alternatively, 
the second marker gene can be a negatively-sdectable marker gene such as codA, tms2, nitrate 
reductase, or SU1. 

By crossing, a plant is generated that expresses the site-specific recombinase and site- 
specific endonuclease that carries the donor construct. Expression of the enzymes will cause 
excision and cutting of the donor molecule, which can then integrate at the target locus by 
homologous recombination. Recombination events can be found by screening for offepring 
that are kanamycin-resistant and are GFF, GUS", or NSM~ (negative selectable marker 
minus) . In these offspring, that portion of the donor that is flanked by recombinase target sites 
has segregated away from the chromosome that originally carried that donor construct. Some 
fraction of these will be targeted recombinants, and they can be found by a molecular or 
genetic screen. Alternatively, it is contemplated that the donor construct, die site-specific 
recombinase, and site-specific endonuclease are all within the same T-DNA, obviating the need 
for crosses. 

Because transforming DNA may undergo rearrangement in plants, it may be necessary 
to test several independently integrated donor constructs to find one that is suitable for use in 
this scheme. The main concern is that the donor T-DNA may be rearranged in such a way that 
the site-specific recombinase target sites flank the GFP marker, allowing for GFP loss from 
the chromosome that originally carried the donor construct. That occurrence would negate the 
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^forsegreganouofkan-RandGFr. ^ K ^^ a ^ caaUcSniiatti 
from nse by molecular ctarattterizannn and by M, «he integrated conarcc, with the 
recombimse alone. Witt, , donor insertion, the action of reccnnbinase causes to* of 

kan-RbotnotGFP. 

Use in cultured tissues, cells, nuclei, or gametes. 

The method of the invention can also be applied in culuned cells or ussues, including 

twh as egga or sperm in varying stagea of their development 



U was demonstrated ma. an emachrcmosotnal DNA molecnlc with cut or broken ends 
that is generated in vivo, through the action of a site-specific recrnnbina* (such aa PLP) sad 
stte-speciSc endonuclease (such aal-SceD.is Kcombinogenfc and can be employed for gene 
targeting. Alternadves for d* reprasentauve embedum^ desert above ana numerous and 
not Innited to the enaymes and constructs used to explam how the invention works. 

Transposases can be used to generate the double-strand (ds) break, substituting for the 
untoue endonucleaae. or to carry can the excision rercuon, substtuting for the recombinase 
Man, tmnsposons, such as P elements in D^opkik,, leave behind a da beak in DNA when 
20 they transpose. TMs pmpeny can he used to generate bn.ken.nded extractaomosomal 
naoleculea for targeting. m w<w ^ ^ ^ ^ 

Tnese examples can be carried out using stabl, integrated teansgene constructs aa fte source 
of the donor molecute (for instence, b, placing die P element construct of Bxanpfc I into a 
Manner Uaospc^on and generating sfcbly n^o^d t^opUla), or transient tnusgenea (for 
25 tnsumce, to T-DNA example of Method 4 below). 1^ expression „ „. „ y 
eapress.on of endogenous transposons or variams thereof, by regulated or eoosnnative 
expression from engmeeted gene construe* that express tmnspcaae. by use of mRNA tiiat 
encodes traosposase, or by using ore purified tranaposase protein. In plants, it may be 
^^^nss the tranaposase and/or re^^ 

tntbemegaspote and microspore motecells, just before or during meiosis. IhefteedDNA 
fragments can be designed for ends-in btrgeting (as shown in the Hgmea) or ends-ou, targeting 
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Genetic screening, selective methods, or molecular methods, can be used to recover the 
targeted recombinants. 



Method 1: Using two copies of a transposon (Figure 11). 

5 A transgenic construct can be produced that carries two copies of a transposon (in tins 

case, the P element of Drosopfuld) that flank the donor DNA. Recombinogenic donor DNA 
refers to the piece of DNA that is freed from the targeting construct as a broken-ended DNA 
molecule, and that is designed to cause homology-directed changes in a specific chromosomal 
locus. The transposition of the two transposons simultaneously, will leave behind two ds 

10 breaks that flank the intervening DNA, freeing that fragment of DNA to recombine with the 
chromosome at the target site. 

Method 2: Using a site-specific recombinase and a transposase (Figure 12). 
In this variation, a site-specific recombinase, such as FLP or Cre (or others known in 
the art), is used to free a segment of DNA that is flanked by recombinase recognition sites 

15 (such as FRTs or lox sites) from the donor construct. This freed DNA is circular in form. It 
will be converted to a linear form by transposition of a transposon from the circle, leaving 
behind a ds break. The procedure can be simplified by using a transient or stable circular 
plasmid as the donor construct. Transposition of the transposon will leave a ds break behind 
in the plasmid. The plasmid is then recombinogenic and can be used for targeting, but with 

20 the disadvantage that vector sequences will be included in the donor DNA. However, these 
can be removed through the use of site-specific recombination or homologous recombination 
induced by a site-specific endonuclease. 

Method 3: Use of transposons to free DNA from the chromosome, and a site-specific 
25 endonuclease to free a donor from the transposon (Figure 13). 

A transposase can be used as an alternative to a recombinase to excise the donor 
construct from the donor site. For ends-in targeting, the donor gene construct can be split as 
shown in Figure 13 and placed within the transposon. Using a transposase for excision, the 
transposase and I-Scel (or other unique endonuclease) can be expressed at approximately the 
30 same time. The fundamental concept relies on the excising of the transposon at the inverted 
repeats by the transposase, followed by cutting at the I-Scel sites with I-Scel. The combined 
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action of the two enzymes creates a recombinogenic donor and is similar to what can be 
accomplished with a site-specific recombinase and site-specific endonuclease. 

Method 4: Use of T-DNA. 

A method similar to that described in method 3 can be employed with T-DNA. The 
construct for this method is analogous to that of method 3 , except for the substitution of the 
respective T-DNA borders for the inverted repeats. This method relies on I-Scel (or other 
unique endonucleases) being expressed in the transformed cells (for example, the egg cell in 
Arabidopsis). The idea is that in cells undergoing transformation, the T-DNA is cut by I-Scel, 
creating a recombinogenic .donor as shown in Figure 13 . 

Further explanation of the invention will be described by examination of various 
embodiments of me invention and reviewing various alternative means by which the invention 
can be carried out. 

Example 1 : 

The first-described embodiment of the invention was carried out in Drosophila using 
broken-ended extrachromosomal DNA molecules to produce homology-directed changes in a 
target locus. Two transgenic enzymes were used for this purpose: the FLP site-specific 
recombinase and the I-Scel site-specific endonuclease. FLP recombinase efficiently catalyzes 
recombination between copies of the FLP recombination Target (FRT) mat have been placed 
in the genome [Golic and Lindquist (1989) CeU 59:499]. When FRTs are in the same relative 
orientation within a chromosome FLP excises the intervening DNA donor construct from the 
chromosome in the form of a closed circle. If the FRTs are close to one another this excision 
is nearly 100% efficient. In accord with the principles of the invention, the excised DNA 
donor construct molecules become recombinogenic if they carry ads break. To generate this 
break we provided for a host organism in which the I-Scel mtron-homing endonuclease from 
yeast was introduced into Drosophila. LScel recognizes and cuts a specific 18 bp recognition 
site sequence [Colleaux, L. et al. (1986) Cell 44:521; Colleaux, L. et al. (1988) Proc. Natl. 
Acad. Set USA 85:6022] which is not normally present in the Drosophila genome. 
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Inducible ds breakage. 

To express I-Scel in flies we constructed a heat-inducible I-Scel gene (701-SceI) and 
used standard P element transformation to generate fly lines carrying the transgene. We used 
two chromosomaUy-integrated tester constructs to assay die efficacy of 701-SceI. Each carried 
a white* reporter gene with an I-Scel cut site adjacent to it as described herein. One of 
the tester constructs also carried a partial duplication of the white reporter gene (Figure 1). 
To test for cutting at I-Scel recognition sites, flies that carried 701-SceI and a reporter 
construct were generated by crossing, and heat-shocked early in their development If I-Scel 
endonuclease cuts the chromosome at the site adjacent to the w + reporter, occasional deletions 
of all or part of the w + gene will occur, and in a white-mill background can be identified by 
the phenotype of eye color mosaicism. The adults that closed exhibited frequent mosaicism 
indicating loss of w sequences. The results demonstrated that the heat-induced I-Scel can cut 
a recognition site introduced into the Drosophila genome. 

15 We also carried out quantitative germline assays of I-Scel cutting efficiency by scoring 

loss of w + in the germline as described herein. The reporter with a cut site adjacent to w + 
exhibited a low frequency of w + loss, but the construct that was flanked by a tandem 
duplication of a portion of w showed nearly 90% loss of w> + , demonstrating that cutting can be 
quite efficient. The 60-fold increase in the frequency of w + loss with the second tester 

20 construct probably does not reflect a real difference in cutting efficiencies, but rather a 
difference in the preferred route of repair. In the second construct, repair with loss of w + 
could occur efficiently either via a single strand annealing mechanism [Rudin and Haber (1988) 
Mol CeU. Biol 8:3918; Maryon and Carroll (1991) Mol CeU Biol 11:3268; Sun, H. et al. 
(1991) Cell 64:1155] or by homologous recombination between the repeats that flank the cut 

25 site. These results indicate that an efficient homologous recombination mechanism exists in 
germline cells and that the double-strand break can provoke that mechanism. 

The coding region of I-Scel was excised from pCMV/SCElXNLS (a gift from M. 
Jasin, Sloan-Kettering Institute; 15) as a 900 bp EcoRI-Sall fragment. The EcoRI overhang 
30 was blunted by Klenow treatment. This fragment was cloned between the blunted BamHI and 
the Sail sites of p70ATG-> Bam [Petersen and Lindquist (1989) CellRegulat. 1:135]. The 
resulting plasmid has the I-Scel gene inserted between the Drosophila hsp70 promoter and its 

37 



WO 01/66717 PCT/US01/07051 

3TJTR. This 701-SceI transgene was cloned as a 2.6 Kb Sall-NotI fragment into the P element 
vector pYCl,8 [Friden and Searles (1991) Nucleic Acid Res. 19:5082], This gave rise to 
pP(y + 70I-SceI]. The 18 bp I-Scel cut site (teemed I-site here) [Colleaux et al. (1988) supra] 
was synthesized as two oligonucleotides, ggccgctagggataacagggtaatgtac (SEQ ID NO:l) and 
5 attaccctgttatccctagc (SEQ ID NO:2) that were allowed to anneal to each other and cloned 
between Not! and Kpnl of plasmid pw8 [Claimants, R. et al. (1987) Nucleic Acids Res. 
15:3947]. This generated pP[w8,I-site], the tester construct of Figure 1A. The same synthetic 
I-site was cloned between the Notl and Kpnl sites of pP[X97] [Golic, M.M. et al. (1997) 
Nucleic Acid Res. 25:3665] to generate pP[X97, I-site]. Each of these constructs was 
10 transformed by standard P element-mediated techniques. The FRT-flanked portion of P[X97, 
I-site] was mobilized to the RS3r4A element on chromosome 2, and to the RS3r-2 element on 
chromosome 3 by FLP-mediated DNA mobilization (20), generating the tester construct of 
Figure IB in two different locations (Golic M. M., et al., (1997) Nucleic Acid Res. 25:3665). 

15 To test I-Scel c utt i ng , males that carried a transformed copy of 701-SceI and one of the 

reporter constructs, with either the reporter-bearing chromosome or its homolog carrying a 
dominant genetic marker, were heat-shocked for 1 hr at 38°C, at 0-3 days of development. 
The heat-shocked males that closed were test-crossed individually, and their progeny scored 
for the eye color. The frequency of w + loss is measured as the fraction of progeny receiving 

20 the reporter chromosome that were white-eyed. For the reporter P[w8, I-site], the results of 
Figure 1 A are the summed results of testing five independent insertions of the reporter that 
were located on either X, 2, or 3. For the reporter of Figure IB, two independent insertions 
weretested. 

25 Example 2: 

We designed a transgenic targeting construct (the donor construct) that had an I-Scel 
cut site placed within a cloned copy of the Drosopftila yellow* (y + ) body color gene. This gene 
was also flanked by FRTs (Figure 2) and the entire assembly inserted with in a P element for 
transformation. In flies that carry this construct the induction of FLP recombinase and I-Scel 
30 endonuclease results in excision of the FRT-flanked DNA to free the donor and cutting of the 
excised circle to generate a recombinogenic donor. 



38 



WO 01/66717 PCT/DSOl/07051 

Two fonns of constructs are typically used in gene targeting - "ends-in" constructs or 
"encW" constructs (Figure 3). Gene targeting in mouse ES cells typically uses ends-out 
constructs [Mansour, S.L. et al. (1988) Nature 336:348], but the donor element that we built 
was designed for ends-in targeting. Ends-in targeting can be generally more efficient than 
5 ends-out targeting in both yeast and mammalian cells [Hasty, P. et al. (1991) Mol Cell Biol. 
11:4509; Hastings, PJ. et al. (1993) Genetics 135:973; Hasty, P. et al. (1994) Mol Cell. Biol 
14:8385; Leung, W.-Y et al. (1997) Proc. Natl Acad. Sci. USA 94:6851]. An ends-in donor 
construct was chosen to increase the frequency of recovering the desired targeted 
recombinants. The donor construct shown in Figure 2 was designed to targetfhey gene which 
10 is located at cytological locus IB, near the tip of the X chromosome. The expected fate of an 
ends-in recombinogenic donor molecule was integration at the locus of homology, producing 
a tandem duplication of the targeted gene as indicated in Figure 3 [Rothstein, R. (1991) 
Methods in Enzymol 194:281]. The targeted locus was the f mutant allele which has a point 
mutation in the first codon [Geyer, P.K. et al. (1990) EMBO /. 9:2247]. Because the I-Scel 
15 cut site in the donor is located to the right of this mutation the result of homologous 
recombination will be that the right-hand copy of y in such a tandem duplication is y + and the 
recessive y mutant phenotype will be masked. The result of gene targeting using the described 
constructs is therefore rescue (recovery of wild-type phenotype) of they mutation. 



20 



We screened for targeted rescue of/ in carrier host flies that carried a heat-inducible 
FLP gene (70FLP), 701-SceI, and the donor construct of Figure 2 (Example 2). We heat 
shocked those flies early in their development, and then test-crossed and screened for progeny 
that were y + but did not carry the chromosome on which the donor construct was originally 
located (Figure 4). Fifty-six independent y + rescue events were recovered and 55/56 mapped 
i to the X chromosome the locus of the f target (Table 1). Molecular analysis using PGR 
revealed that in the majority of cases 02t sequences were still present in close proximity to y 
sequences. Therefore the p2t sequence served as a molecular marker for cytological 
determination of the site of y + integration. (The P2t and p3t genes shown in Figure 2 are part 
of a selection scheme that was not implemented in these crosses.) The P2t gene was used as 
a probe for in situ hybridization to polytene chromosomes. Five independently recovered y + 
lines were examined: in all five, p2t sequences 
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Table 1 

Independent yellow Rescue Events 



PI ass 


Targeted 


Non-targeted 


I 


19 


0 


n 


19 


0 


m 


13 


0 


IV 


4 


1 


Total 


55 


1 



10 

were found at cytological locus IB in addition to the normal location of the p2t gene at 85D 
on the right arm of chromosome 3 (Figure 5), com%ming mat targeted integration of the donor 
15 construct had occurred in the y region. 

The y rescue events obtained in the foregoing example occurred tar more efficiently in 
the female germline than in the male germline. Fifty-three independent y + progeny (80 total) 
were recovered from 224 female test vials for an overall efficiency of approximately one event 
20 per 4 vials screened. Each vial produced 100-150 progeny, so the absolute rate was 
approximately one independent y + offspring for every 500 gametes. Only three events were 
recovered from 201 male test vials yielding a 16-fold lower efficiency. Because, in 
Drosophila, a meiotic recombination occurs in females but not in males, these results raise the 
question of whether efficient gene targeting relies on the machinery of meiotic recombination. 
25 In other words, does targeted recombination occur in female meiotic cells? Although our 
experiments were not specifically designed to address this question, some evidence on this 
point can be adduced by considering whether the targeting events occur independently or in 
clusters. Meiotic events are expected to be independent, and exhibit a Poisson distribution. 
Events that occur in mitotic cells of me germline can be replicated as cells pass through S 
30 phase and may produce multiple y + progeny from a single event, leading to clustering of the 
recovered y + events. The female germline data differed significantly from a Poisson 
distribution (P< 0.001), exhibiting many more clusters than predicted, suggesting that the 
targeting events occurred pre-meiotically. The non-independent clusters that arose must have 
occurred many mitosis prior to meiosis, because the last four mitotic divisions in females 
35 produce a cohort of cells from which arises a single gamete. 
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is present in the insertion alleles, the stronger hybridization signal exhibited by the upper band 
in lane 6 (Figure 7) suggests that in at least some cases it is from the y gene. The Class III 
events may arise by imprecise initiation or resolution of the recombination reaction. 

The fourth and least frequent class consists of y 1 rescue events resulting from the 
integration of two additional copies of y (Figure 6). Five such events were recovered: four 
were targeted to yellow and produced a triplication of the gene, and one occurred on 
chromosome 3. Although our experiments used flies with only a single donor transgene, when 
a cell is in G2 two copies of the donor will be present. The two copies on sister chromatids 
might dimerize through FLP-mediated unequal sister chromatid exchange [Golic and Lindquist 
(1989) supra], or by end-joining of two independently excised and cut donor molecules. 
Integration of such a aimer could produce the observed results. Although all three bands 
detected with a y probe should hybridize with equal efficiency, the class IV event shown in 
Figure 7 Cane 9) shows a stronger hybridization signal on the 8.0 kb band man on the 10.5 and 
12.5 bands. This particular event may carry yet a fourth copy of y. The remaining four class 
IV recombinants appear to be the simpler events diagramed in Figure 6. 

In these mutation-rescue experiments, the donor DNA was cut in the middle of the 
wild-type rescuing allele. To generate a chromosomal y + gene, recombination that is 
stimulated by the cut must almost inevitably occur with the y l allele. If a single copy of the 
donor were to integrate elsewhere it seems highly unlikely that a functional copy of y + would 
be produced. Thus, our screen practically demands that only integration events targeted to y 
would be detected, and Class I, H, and m events give no information on the relative 
frequencies of targeted events versus random insertions. However, the recovery of Class TV 
events allows us to examine this issue because the middle copy of y + should be functional even 
when the donor molecule integrates, not by recombination with y, but at some other site. Class 
IV events should be recoverable whether targeted to y or not. We recovered five Class IV 
events and four of the five had integrated at the normal location of y on the X chromosome. 
Therefore, even in cases where it was possible to detect integration at sites other than y, the 
majority of recombinants were targeted to y. The single non-targeted Class IV integrant was 
located on chromosome 3 but did not appear (by Southern blotting) to be targeted to the p2t 
gene. 
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Molecular analysis. 

All 56 independent y + lines were analyzed in more detail by Southern blotting. The 
results showed that the 55 X-linked events were the result of targeted recombination at they 
locus. We recovered four classes of targeted events that rescued the / mutation (Figure 6). 
5 The first class consists of simple allelic substitution events that Southern blotting cannot 
distinguish from the original y l allele (Figure 7). These may have been produced by simple 
double crossovers between the donor and y 7 (as diagramed in Figure 6) or by gene conversion. 

The second and equally numerous class is composed of tandem duplications of y, with 
10 the P2t gene located between the two copies. These almost certainly arose by integrative 
recombination between the chromosomal y 1 allele and the cut donor as shown in Figure 6. 
(Molecular data are shown in Figure 7.) 

When the donor element was constructed, the I-Scel cut site was cloned into the SphI 
15 site within the intron of y, destroying the SphI site in the process. Sixteen of the 19 Class II 
alleles had regenerated the SphI sites in both copies of y, demonstrating that the I-Scel 
recognition site can be readily removed during the recombination reaction, and the site 
converted to the sequence of the targeted locus. 

20 The high frequency of Class II tandem duplications suggests another route by which the 

Class I events may have been produced. Recombination between directly repeated y genes at 
a site to the left of the mutation in y 1 would reduce the duplicate genes to a single copy of y + . 
In previous experiments, small tandem duplications that we have generated are very stable (for 
example the P element of Figure IB; also references Golic and lindquist (1989) supra, and 

25 Golic and Golic (1996) Genetics 144:1693]. If Class I events do occur by this route it is likely 
that it immediately follows the integration event when nicks or breaks are still present. As 
Figure 1 shows, tandem duplications are readily lost when a ds break is introduced between 
the duplicate copies. 

30 The third class consists of tandem duplications of y with insertions or deletions of 

material in one of the two copies (Figure 6). These alterations occur about the location at 
which the I-Scel cut site was placed. Although we have not identified the additional DNA that 
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The results demonstrate that randomly inserted transgenes can be converted to targeted 
insertions through the use of a site-specific recombinase and unique site-specific endonuclease. 
The method was quite efficient, allowing targeting events to be identified simply by a genetic 
5 linkage screen, and produced an average of one targeted recombinant for every 4-5 vials 
examined (in females). Our screen detected events that used a donor DNA to convert a mutant 
allele to wild type. The same basic method, modified by the choice of donor construct and 
selection method can be used to generate any desired modification of a target gene even if the 
target gene is known only by the sequence. Essentially any gene of the Drosophila genome 
10 can be targeted, using data from the published Drosophila genome sequence 
rhtto://www.fruitflY.org/.1 It will be apparent to those skilled in the art that the technique 
developed is readily adaptable to targeting any gene or DNA segment whose sequence is 
known. Many of the techniques that have been developed for disrupting genes in yeast are 
adaptable for analogous application in Drosophila [Rothstein (1991) supra] . 

15 

Example 3 : 

The data of Examples 1 and 2 do not rule out the possibility that the targeted gene 
modification observed relied on a type of DNA repair termed Break-induced Replication (BIR). 
Hypothetically, a single one-ended homologous exchange may have occurred, leaving the 

20 recombinant chromosome with a truncated terminus. In order to be recovered as a viable 
product this chromosome with a modified target locus would be repaired by BIR, wherein the 
broken terminus invades the homolog prompting unscheduled replication to the end of the 
chromosome [see, e.g. Engels, W.R. (2000) Science 289:1973]. Since the yellow gene that 
we targeted ties approximately 110 kb from the X chromosome telomere, it is not unreasonable 

25 to imagine that a chromosome break at this location could be repaired by replication to the end 
of the chromosome. Additionally, targeting was much more efficient in the female gennline 
(with two X chromosomes) than the male gennline (with one X), and the BIR model, wherein 
repair of a one-ended recombination event relies on replication templated from a homolog, 
provides an explanation for this difference. Finally, the classes of targeting events that we 

30 recovered could be explained both by homologous recombination, or by a combination of 
homologous exchange and BIR. The significant implication of the foregoing explanation is 
that, if targeting must involve BIR, then it is likely that only genes situated near telomeres can 
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be successfully targeted because of the requirement for continuous replication to Hie end of the 
chromosome. Thus , it is useful to know whether me technique of the invention can be applied 
broadly, or whether it will be limited to genes near telomeres. 

Straightforward homologous recombination is a more parsimonious explanation for our 
data. In considering the hypothesis that the gene targeting described in Examples 1 and 2 relies 
on BIR, and secondarily on the presence of a homolog, one cannot overlook the fact that 
genuine targeting events, although small in number, were recovered from males. These males 
of course have but a single X chromosome. Furthermore, if a one-ended homologous 
recombination event can occur there is no obvious reason why two-ended events should not 
occur. The following experiment was performed to test the foregoing hypothesis. Data from 
the experiment, described herein, demonstrate that we have generated a targeted knockout of 
a gene mat is very tar removed from telomeres. Consequently, the hypothesis just described 
does not account for the observed results and the method of the invention has been shown to 
be broadly applicable for any target gene. 

The pugilist (pug) gene encodes a homolog of the trirunctional form of the enzyme 
methylene tetrahydrofolate dehydrogenase, and animals carrying mutations in this gene show 
eye color defects [Rongetal. (1998) Genetics 150:1551]. The gene is located at 86C on the 
right arm of chromosome 3 approximately 20 Mbp from the nearest telomere. A 2.5 kb 
fragment of the gene was engineered lacking the first, and part of the fourth and fifth exons, 
by inserting a recognition site for I-Scel endonuclease at an Apal site in exon 4, and placed it 
into the P element vector P[> W>] [Golic et al. (1989) CeU 59:499]. In this vector, the 
engineered pug fragment and vf are flanked by direct repeats of the FLP Recombination 
Target (FRT). Transformants were generated and crossed to produce flies that carry 70FLP, 
701-SceI and the pug donor construct. We heat-shocked these flies as described herein [see 
also Rong et al. (2000) Science 288:2013 incorporated herein by reference in its entirety] and 
carried out a segregation screen to look for mobilization of the marker gene to a different 
chromosome. From 455 female vials we recovered 3 independent cases of W mobilization. 
Two of me events were instances of pug knockout produced by targeted recombination 
between the donor DNA and the resident^ gene (Figure 14). The pug allele at the left (J' 
pug a) carries a deletion which includes part of exon 4, exon 5 and 3' UTR of pug. The pug 
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allele at die right (5' pug a) lacks the promoter and exon 1 of pug. Three criteria support this 
conclusion: Southern blotting (Figure 15) showed bands of the sizes expected for a Class II 
targeting event [Rong et al . (2000) supra] ; in situ hybridization showed that the W° gene was 
now located at 86C; and the targeted alleles exhibited the pug null phenotype. The remaining 
event was an integration at a site other than pug and was not examined further. 

The results of the pug targeting experiment do not rule out the possibility that some of 
the targeting events we previously reported at yellow did arise by homologous recombination 
and BDL The explanation for the difference in targeting efficiency between pug and yellow is 
most likely due to the different amounts of donor:target homology in the two experiments — 
814) in the yellow experiments vs. 2.5 kb in the pug targeting experiments reported here. 

The results of the pug targeting experiment also show that non-targeted insertions, 
although they do occur, are not so frequent as to be a significant nuisance. Here, the targeted 
recombinants outnumbered the non-targeted recombinants by 2:1. If targeting efficiency is 
improved, for example by increasing donontarget homology, then non-targeted events would 
constitute an even smaller portion of events detected by the segregation screen. Tending to 
confirm this supposition, in the yellow targeting experiments a majority of the informative 
Class IV events were a result of targeted recombination [Rong et al. (2000) supra]. 

Most importantly, the results presented here demonstrate that non-telomeric genes can 
be targeted and modified by homologous recombination, and this can be done solely by 
following the inheritance of an arbitrary marker gene. 

Example 4 : 

Another embodiment of the method for targeted mutagenesis is diagramed in Figure 
8. A fragment of the gene to be mutated has an I-Scel or other unique endonuclease cut site 
placed within it. This donor DNA and a marker gene is placed between FRTs and then into 
a transposon vector for transformation. After induction of FLP and I-Scel in females, targeting 
events can be detected by altered linkage of the marker gene, and verified by genetic or 
molecular techniques. As we have shown in our screen the targeted events outnumbered non- 
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targeted events. Thus, it will be relatively easy to recover the desired recombinants. In the 
example of Figure 8, a Class II integration event produces two truncated mutant alleles. 

Many of the targeted events that we recovered in the first described embodiment were 
5 not produced by precise recombination. The Class HI events had alterations in the targeted 
locus that would not be predicted by homologous exchange. Some of the Class n events may 
also have very small alterations that were not detectable by Southern blotting. It is also likely 
that there were many additional Class in targeted events tot were not recovered in our screen 
because they carried deletions that destroyed the y + locus. So, although gene targeting often 

10 resulted from precise recombination there are also many imprecise and potentially mutagenic 
events. It follows that is it not necessary that the donor construct carry a mutant form of the 
target locus (such as the truncated gene of Figure 8). Mutant alleles can be produced at a 
reasonable rate simply by imprecise targeting events. Such a result has precedence in die 
examination of stably transformed Drosophila cell lines. Cherbas and Cherbas [(1997) 

15 Genetics 145:349] observed that in many cases, DNA transfected into cell lines had integrated 
near the chromosomal locus with homology to that DNA, and that rearrangements were often 
produced that in some cases generated mutations of the chromosomal locus. They termed the 
phenomenon parahomologous targeting and it may be closely related to the processes that are 
responsible for the Class III events that we recovered. 

20 

As previously described, an I-Crel cut site may also be introduced, which allow the 
reduction of class IH alleles to a single copy mutant allele. 

The invention makes it possible to introduce point mutations and a variety of other 
25 changes. Moreover, the not infrequent occurrence of Class I events indicates that it is feasible 
to produce allelic substitutions at other loci. Finally, the frequent replacement of die I-^Scel 
cut site sequences at die termini of the donor with the wild-type genomic sequence indicates 
that it is feasible to carry out targeting with an I-Scel cut site placed within a gene's coding 
sequence, and yet not necessarily destroy that portion of the gene. 

30 
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^Example 5: 

The procedures of Examples 1 and 2 were modified in two ways to adapt the invention 
to plants. First, we used the Cre/Lox recombination system in place of the FLP/FRT 
recombination system* The Cre/Lox system was utilized since prior studies in die laboratory 
5 made the starting constructs immediately available. The Cre/Lox system has been 
demonstrated to work well in plants [Sieburth, Drews and Meyerowitz (1998) Development 
125:4303]. The FLP-FRT system, however, can work equally well according to the literature. 
Second, we utilized plant specific promoters to drive expression of the Cre and I-Scel genes 
(discussed below). The gene targeting is described for Arabidopsis because its short generation 
10 time, ease of transformation, and small genome make it a convenient model for gene targeting 
in plants. 

In adapting the method of the invention to plants (as to any organism) aspects of the 
organisms biology should be taken into account. Specifically, plants have a different pattern 

15 of development from animals which affects the developmental stage when homologous 
recombination is most likely to occur. The most important difference is that plants lack a 
"germ line" in the sense of an animal germ line. In animals, a specific set of cells (the germ 
line cells) is set aside early in development to become the germ cells. In plants, no such event 
occurs. Plants develop via meristem growth. The shoot apical meristem at the tip of the plant 

20 contains a group of rapicUy-dividing cells that give rise to die entire above-ground portion of 
the plant (i.e. , the entire shoot) including the flowers. At a specific time of development, the 
shoot apical meristem gives rise to floral primordia. Floral primordia develop into flowers 
containing four organ types: sepals, petals, stamens, and carpels. Inside the stamens and 
carpels are produced the microspore mother cells and megaspore mother cells, respectively. 

25 The mother cells undergo meiosis to produce haploid microspores and megaspores, which 
develop into the haploid male and female gametophytes that contain the sperm and egg cells, 
respectively. 

Thus, for an homologous recombination event to be transmitted to the following 
30 generation, it is preferred to express the Cre Recombinase and I-Scel enzymes in one of the 
following patterns: (1) the zygote, (2) the embryo cells that give rise to the shoot apical 
meristem, (3) the portion of the shoot apical meristem that gives rise to the germ cells (the L2 
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layer in most species), (4) the cells of a developing flower that give rise to the mother cells, 
(5) the mother cells, (6) the developing gametophytes, (7) the egg and/or sperm, or (8) cultured 
cells. 

5 A convenient place to induce homologous recombination is in the mother cells that give 

rise to the germ cells. First, homologous recombination occurs at elevated frequency in cells 
undergoing meiosis because this is die time when meiotic homologous recombination normally 
occurs. Therefore, the enzymes needed to carry out the process are clearly present and 
functional in these cells. Second, because each mother cell gives rise to a different gamete, 
10 each mother cell represents an independent "attempt" at homologous recombination. Finally, 
each plant produces thousands of mother cells; thus, thousands of homologous recombination 
"attempts" occur in each plant. 

By contrast, gene targeting by homologous recombination in the shoot apical meristem 
15 is likely to occur at a lower frequency, but may still be used in the invention. The shoot apical 
meristem cells divide rapidly and are less likely to contain the enzymes required to undergo 
homologous recombination. 

Two promotes were used to drive expression of the Cre Recombinase and I-Scel genes 
20 in. Arabidopsis. The first is the promoter from the Arabidopsis AtDMCl gene [Klimyuk and 
Jones (1997) Plant Journal 11:1-14]. This promoter directs expression to die pollen mother 
cells and megaspore mother cells. As described above, directing expression of the Cre and I- 
Scel genes to the mother cells has several advantages. The second promoter used is the 
promoter from the Arabidopsis HSP 18.2 heat shock gene [Takahashi and Komeda (1989) MoL 
25 Gen. Genet 219:365-372]. This promoter provides inducible expression in Arabidopsis, 
which is convenient for testing various developmental stages for effectiveness of obtaining 
homologous recombination. This promoter has been used to drive expression of the Cre 
Recombinase gene in Arabidopsis [Sieburth et al. (1998) Development 125:4303-4312] . Four 
enzyme constructs were made as summarized in the table below: 

30 
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Construct Name 


Promoter 


Gene 


DMCl::Cre 


AtDMCl 


Cre 


DMCl::KceI 


AtDMCl 


I-Scel 


HS::Cre 


HSP18.2 


Cre 


HS::ISceI 


HSP 18.2 


I-Scel 



HS = heat shock promoter 
DMC1 = AtDMCl promoter 



la addition to the above, other promoters can be utilized, for example, other useful 
promoters include LEC1 (lotan et al. (1998) Cell 93, 1195-1205), which confers expression 
in the zygote and early embryo; the CaMV 35S promoter, which confers somewhat constitutive 
expression and will induce homologous recombination in the cells that give rise to the shoot 
apical meristem, and the SHOOT MERISTEMLESS (Long et al., (1996) Nature 401, 769-777) 
and OLAVATA3 (Fletcher et al. (1999) Science 283, 1911) promoters that will drive expression 
in the L2 layer of the shoot apical meristem. A preferred promoter is one that can drive 
expression in the L2 layer, which contains the shoot apical meristem cells that give rise to 
germ cells. Candidates include STM, CLV1, CLV2, CLV3. 

The present example employs gene targeting to convert a mutant allele into a wild-type 
allele. This approach obviates die need to include a complex selection strategy. The targeting 
is demonstrated with two genes that have well-defined and easily-scored mutant phenotypes, 
and that are transformable at high frequency. The genes are the Arabidopsis CRABS CLAW1 
(CRC1) gene [Bowman and Smyth (1999) Development 126:2387-2396] and the Arabidopsis 
CLAVATA1 (CLV1) gene [Clark et al. (1997) Cell 89:575-585]. Donor constructs include 
a wild-type copy of the gene with an I-Scel site in an exon flanked by loxP sequences. We 
have made two donor constructs as summarized in the table below: 
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Construct Name 


Gene 


CRC1-D 


CRC1 


CLV1-D 


CLV1 



The general structure of the donor construct is as follows: 



LB 


(-) SMGene 


loxP 


(+) SM Gene ! 


TGMS 


I 


TGMS 


loxP 


RB 



LB = left border of T-DNA 

(-)SM Gene = negative selectable marker gene (optional) 

(4-)SM Gene = positive selectable marker gene (optional) 

TGMS = target gene modifying sequence 

I = I-Scel site within the target gene modifying sequence 

RB = right border 
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While this example describes a method of converting a mutant allele to a wild-type 
allele, other types of conversions are within the scope of the invention. One such conversion 
involves the converting a wild-type allele to mutant allele, which can in certain instances 

1 5 involve the use of selection schemes to recover organisms in which the targeting has occurred. 
Such selection schemes can advantageously employ selectable markers. The negative 
selectable marker gene used herein is the E. coli codA (cytosine deaminase) gene [Mullen et 
al. (1982) PNAS 89:33-37; Mullen and Blaese (1994) U.S. Patent No. 4,975,278; Stougaard 
(1993) Plant Journal 3:755-761; Serino and Maliga (1997) Plant Journal 12:697-701]. A 

20 variety of other negative selectable marker genes are available including the Agrobacterium 
tms2 gene [Depicker et al. (1998) Plant Cell Rep. 7:63-66] the nitrate reductase gene 
[Nussaume et al. (1991) Plant Journal 1:267-274], and the alcohol dehydrogenase gene. The 
positive selectable marker gene used herein is the neomycin phosphotransferase gene, which 
confers resistance to fcanamycin [Fraley et al. (1998) PNAS 80:4803-4807]. Many other 

25 positive selectable marker genes are available and known to those of ordinary skill in the art. 

Various modifications to the foregoing procedure can be introduced to simplify and 
streamline the process. The number of generations to obtain a homozygous mutant can be 
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reduced by instituting two changes. The first is to introduce the donor constructs into a carrier 
host, a plant strain that already has been transformed with the enzyme constructs. This change 
will decrease the number of generations to three. The second change is to utilize promoters 
to drive expression of the Cre Recombinase and I-Scel genes very early during embryo 
development, ideally in the egg cell of zygote. The combination of changes reduces the 
number of generations to two. 

The time required to make donor constructs can be reduced by constructing a cloning 
vector to simplify cloning the target modifying sequence. The modifying sequence cloning site 
(CS) contains an I-Scel site flanked by two sites for target modifying sequence cloning (Tm-L, 
left TM cloning site; TM-R, right TM cloning site). It also has a multiple cloning site (MCS) 
containing several unique restriction sites. 



LB 


(-)SM Gene 


loxP 


(+) SM Gene 


CS 


loxP 


RB 



CS = target gene cloning site = 



TM-L 


I 


TM-R 


MCS 



In addition to the above, it is possible to induce homologous recombination at the 
moment of T-DNA integration. With in planta transformation, it is thought that it is the egg 
cell that becomes transformed. The donor construct is introduced into a plant strain expressing 
the Cre recombinase and I-Scel endonuclease genes in the egg cell. Doing so confers the 
advantages of saving one generation of time to obtain a plant homozygous for the gene 
modification. 

It is also possible to use a transposon to excise the target gene. This obviates the need 
for using the Cre-fox or Flp-FRT system to do so. The transposase and I-Scel endonuclease 
are expressed at the same time. The transposase excises the transposon and then I-Scel 
endonuclease cuts at the I-Scel sites. These cuts create the same situation that is obtainable 
with the Cre-foc or Flp-FRT system (see Fig. 17). Again, it can be advantageous to express 
transposase/I-Scel in the mother cells, just before or during meiosis. 
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Introducing the Constructs into the Arabidopsis Genome: 

We have introduced all constructs into the Arabidopsis genome u$ing Agrobacterium- 
mediated transformation. Each construct was assembled in an is. coli plasmid vector 
(pBluescript or other) and then ligated into the pCGN1547 Binary Ti-plasmid transformation 
5 vector (McBride and Summerfelt (1990) Plant Molecular Biology 14:269-276]. The 
pCGN1547 clone was first introduced into E. coli and then into Agrobacterium strain ASE. 
Agrobacterium strains containing the various constructs were used to infect mutant (civ 
mutants or crcl mutants) Arabidopsis plants using in planta transformation [Chang et al. 
(1994) Plant Journal 5:551-558; Bechtold et al. (1993) C.RAcad. ScL Paris Life Sci. 

10 32*1194-1199; Clough and Bent (1998) Plant Journal 16:735-743; Katavic et al. (1994) Mol 
Gen. Genet. 245:363-370]. In this procedure, Arabidopsis plants are dipped in an 
Agrobacterium solution and the plant reproductive tissues become invaded by the bacteria. 
Optimal heat shock conditions may vary from strain to strain. Testing and determination of 
heat shock conditions can be performed by one of ordinary skill in the art. It is thought that 

15 the egg cell becomes transformed [Ye et al. (1999) Plant Journal 19:249-257; Bechtold et al. 
(2000) Genetics 155:1875-1997]. Transformed strains were selected for kanamycin resistance. 
Using this procedure, we have generated six Arabidopsis strains: 



Strain Name 


Genetic Background 


Introduced Construct 


clvl-HSE 


clvl 


HS::CreandHS::ISceI 


crcl-JSE 


crcl 


HS::CreandHS::ISceI 


clvl-DCME 


clvl 


DMCl::Cre and DMCl::ISceI 


crcl-DCME 


crcl 


DMCl::Cre and DMCl::ESceI 


clvl-D 


clvl 


CLV1 Donor Construct 


crcl-D 


crcl 


CRC1 Donor Construct 



20 HS = heat shock promoter 

DMC1 = AtDMCl promoter 
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These strains were grown and crosses were carried out to bring together the enzyme 
constructs and donor constructs. Specifically, the following crosses were carried out; 

(1) Strain clvl-HSEx Strain clvl-D; 

(2) Strain clvl-DCME x Strain clvl-D; 

(3) Strain crcl-HSE x Strain crcl-D; and 

(4) Strain crcl-DCME x Strain crcl-D. 

Inducing Recombinase and Endonuclease Enzyme Expression: 

In the strains harboring the heat shock promoter-enzyme constructs, induction is carried 
out by immersion in warm water as described by Sieburth et al. (1998) Development 125:430- 
4313. Heat induction is carried out at a variety of developmental stages including developing 
embryos (to induce in the cells dial give rise to the shoot apical meristem), the tips of floral 
stems (to induce in the cells of the shoot apical meristem) , developing flowers (to induce in the 
cells that give rise to the mother cells), flowers undergoing meiosis (to induce in the mother 
cells), and mature flowers (to induce in the germ cells). 

In the strains harboring the DMC1 promoter-enzyme constructs, expression is not 
externally induced. As described above, the developmentally-regulated promoter induces 
expression of the enzymes at a time just before meiosis. 

Identifying Plants in which HR has occurred: 

Plants that have been induced are allowed to undergo self-pollination and progeny seed 
are collected. The progeny seed are grown and scored for the mutant phenotype. Plants in 
which targeting has occurred are wild-type. Genotype is verified using PCR. 

Example 6: 

Ends-out targeting in some instances may be preferable to ends-in targeting. It can 
simplify the construction of the donor element and provide a faster and simpler route to the 
generation of deletions with precise endpoinls. These deletions can also cany a dominant marker 
gene which can simplify their use in subsequent crosses. 
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Targeting yellow by ends-out methods 

The efficiency of ends-out targeting can be measured wiih yellow. The donor element 
is constructed by placing two I-Scel cut sites into the polylinker of the P vector pw8 and then 
cloning the 8 kb y+ fragment between those sites. After transformation and crossing to 701-SceI 
flies, I-Scel expression in the oflfepring is induced by heat shock. A linear DNA fragment 
comprising thej> + gene is freed by double-cutting with I-SceL See Figure 19. Hie heat-shocked 
flies are then mated and screened for progeny that are y+, but not w + . These can arise from 
targeted recombinants at yellow or non-targeted insertions elsewhere in the genome. It is also 
possible to lose w + function from within the P element by single cutting near w^ and loss of part 
of the wte gene to exonucleolytic digestion. Therefore, it is required that the j/ 4 " w - events map 
to a different chromosome to be demonstrative examples of>>+ mobilization. The structures of 
any y+ genes that map to the X chromosome (potential targeting events) are characterized by 
Southern blotting. 

This event relies on two I-Scel cuts rather than a single cut Since the efficiency of 
single-cutting is approximately 90% for a single I-Scel site following heat-shock induction of 
701-SceI, it is estimated that -80% of the cells experience a double cut An independent estimate 
of the efficiency of double-cutting can be provided by scoring the frequency of complete yellow 
gene loss that arises from the double cut with this ends-out construct The frequency of double- 
cutting can be increased by using two or more copies of 70ISceL 

The ends-in targeting scheme of Examples 1 and 2 allows for repair of an I-Scel cut by 
FLP-mediated recombination, either before (in which case the cut occurs on an 
extrachromosomal molecule) or after scission. The described ends-out construct provides no 
such built-in mechanism to restore the cut chromosome, so that cell death might occur in some 
instances. Cell death is unlikely for the following reasons: first, when an unrepairable 
chromosome break is generated by breakage of a dicentric chromosome (because only a single 
broken end is present), the result in the soma is cell death [Ahmad and Golic (1999) Genetics 
151:1041-51]; second, following I-Scel expression in flies carrying a single cut site, little or no 
cell death is observed. Thus, the chromosome from which the donor is excised is likely to be 
repaired 
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Alternatively, a new version of the donor in which the I-Scel site-flanked yellow+ gene 
is also flanked by FRTs can be used. This construct can be used for ends-out targeting using I- 
Scel and FLP expression together. When FLP acts first, it will excise the donor, leaving behind 
an intact chromosome. The donor can then be cut by I-Scel. 

5 

Precise deletion of yeUow+ can be generated using a replacement strategy. Upstream and 
downstream regions of yellow are cloned to flank a whs gene and I-Crel recognition site, and this 
assembly placed between I-Scel sites. 

10 After transformation, a segregation screen for mobilization of w^s to the chromosome 

in a y+ w background is performed. A targeted recombination event results in the precise 
deletion of yellow and insertion of m its place (Figure 20). Recombinant products can be 
characterized by Southern blotting. 

15 Serial substitution 

One use of ends-out targeting in yeast is to first insert a marker gene into the target locus 
and, in a second step, replace that marker with an altered allele of the gene in question, foUowed 
by screening (or selecting) for loss of the marker. A similar scheme can be carried out in flies 
by making use of the I-Crel cut site that was included next to w^. Cutting at this site can 
20 stimulate replacement of the marker with sequences from a donor template by gene 
conversion. 

Replacement of the gene is accomplished at the yellow locus by exchanging it for a 
modified y+ allele. The y gene missing part of the intron, deluding the tarsal enhancer, includes 

25 y flanking regions to provide the homology for exchange. The crosses can be carried out with 
a variety of yellow alleles on the homolog (mcluding deletions or y+ alleles) by distinguishing 
homolog-templated events from those that use the introduced gene as a template. The molecular 
stractures of white loss events that are yellow (possibly resulting form gap enlargement and end- 
joining or incomplete gene conversion) oi yellow+ (resulting from templated gene conversion) 

30 can be examined. 
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**m**Boy mm Pro* MAca* Set ^ 8*1735-9] and GWe.a,.^ 
^^^^^^^^ > 

conversion. Uus, alt^y, Section of a helper I-Cre! gene or Kiel mRNA ^ 
t^togenerama^bletmosfbnnati mt^^^^^^^^ 
g-exmveraion. tefcW-tt,,.^^ 
by tage drmc repeat as « an erais-in mrgetag ^ ^ „ „, ^ ^ 
P-^fcrelinU.an.g^byi^o,^^ 
conversion may constitute a large fraction of all events thatlose„fe. 

^'^rf«^rfWAto M bed^by 1 he«^ ta g e(bgcM1|)e 
*~nsmg tlK ^7«lcciasadiagr^c.es t Tlesegenesarcpreseatin^ehstem 
S7A«d87C«d S pan«H,a nd5 0 B , Wonese^to^lena^n^of^^ 
be used for targeting. Alternatively, antosomal facets can be chosen. 
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Implenumbmon of poauve-negative select can be «| to eliminate non-targeted 
of events in dnosophila 

zu a marker gene from one chromosome to another. 

Elimination of mapping and marking steps as prerequisite for targeting. 

More specificaUy, the signal for a targeting event is mobilization of the donor from a 

™™^y^^of^ matest ^ Theneedformnppfogandmarldng 
the donor element-bearing chromosome causes a substantial time delay for producing a fly with 
amodifredtargetgene. ^V^^^oUs^,^^^.^^ 
element insertion and a Class H targeting event, the procedure can b. shortened signifrcantly 
For example, in a transformed copy of TV2, the targeting conduct and the ^ are flanked by 
FRTs (see Figure 20 for the structure of the targeting vector), fo a class H (or ffl or IV) targeting 
event, there is a copy of ^ that is not flanked by FRTs. Ihe mosaicism, or lack thereof that 



56 



WO 01/66717 PCT/US01/07051 
is produced by FLP can be used as a criterion for distinguishing flies with the original TV2 
insertion from flies with a targeting event (see Figure 22). 



Flies that carry 70FLP, 70ISceImA the targeting construct are heat-shocked and crossed 
5 to flies that are homozygous for an insertion of 70FLP that show a high degree of expression 
without heat shock (see Figure 23 for crossing scheme). Most progeny are entirely white-eyed 
owing to excision and loss of the donor construct carrying a gene. Some progeny with eye 
pigment can arise from the infrequent failure of excision; these appear as mosaics owing to FLP 
expressed from the constitutive 70FLP transgene. Targeting events produce progeny with solidly 
10 pigmented eyes (as does non-targeted insertion). Targeting is verified by a backcross to the 
constitutive 70FLP strain; progeny with a lack of mosaicism are characterized by Southern 
blotting to confirm that they were produced from the expected targeting events. 

Due to die efficiency of FLP-mediated excision, the number of false positives can be very 
15 low. This screen requires the same number of generations as the original segregation screen, but 
the step requiring mapping, marking, and making of stock transformants is completely eliminated 
as a prerequisite for targeting, and saves about six weeks in the overall process. 

According to this scheme, the targeting events can be recognized in cis. During P- 
20 induced gap repair and gene conversion, ectopic templates in cis are used more efficiently than 
templates on other chromosomes. The targeting efficiencies with donors in cis and in trans to 
the target locus are compared to determine the effects on efficiency. 

It can be desirable to map die original transformant, and possibly keep it as a stock in 
25 case the targeting crosses were unsuccessful and needed repeating. But these steps can be carried 
out in tandem with the targeting screen. The main purpose of mapping is that, after targeting, 
the original (now unmarked) insertion of TV2 can be crossed out FLP and Z-iSbe/ elements can 
also be crossed out The process can be simplified by choosing FLP and IScel insertions that are 
not on the target chromosome. Once a suitable targeting event is recovered, there is no longer 
30 a need to keep the original insertion. 
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Development of a marker segregation vector 

An alternative scheme involves generating a vector that carries two markers to visualize 
segregation of the original P element insertion and the targeting molecule. This vector has a 
structure similar to that of pTV2 (the plasmid clone of the TV2 vector) between the FRTs and 
can carry a second dominant marker outside the FRTs. The scheme to detect targeting relies on 
the dominant marker, which is included in the construct Eye color markers are not well-suited 
to this' scheme, but a reasonably good marker is the hybrid GMR-P35 gene [Hay et al.(1994) 
Development 120:2121-29]. This construct expresses the baculovirus P35 protein in the eye 
posterior to the morphogenetic furrow. The result is a moderate disorganization and roughening 
of the eye. After synthesis of FLP and I-Scel, targeting events are detected as progeny that are 
but without rough eyes. 

The present invention is not to be limited in scope by the specific embodiments 
described herein. The described embodiments are intended to be illustrative of individual ways 
that general aspects of the invention and functionally equivalent methods and components 
operate within the scope of the invention, including methods and components known in the art, 
whether or not they are specifically described or listed herein. Various modifications of the 
invention, in addition to those shown or described herein, will become apparent to those skilled 
in the art from the foregoing description and accompanying figures. Such modifications are 
intended to fall within the scope of the appended claims. 
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1 . A method of gene targeting in a transformable host organism comprising: 

choosing a target gene of the host organism or portion thereof having known or 
cloned sequence, 

transforming the host organism to contain an expressible gene encoding a 
unique endonuclease, 

transforming the host organism to contain an excisable donor construct having 
a segment of sequence homologous to the target gene or portion thereof, the segment 
having a unique endonuclease site or sites inserted therein or adjacent to, 

excising the donor construct and expressing the unique endonuclease, whereby 
a recombinogenic donor is produced, and 

selecting for progeny of the host organism wherein recombination between the 
target and the recombinogenic donor has occurred. 

2. The method of claim 1 wherein the endonuclease is expressed under control of an 
inducible promoter. 

3. The method of claim 1 wherein the endonuclease is expressed under control of a tissue- 
specific promoter. 

4. The method of claim 1 wherein the endonuclease is expressed under control of a 
ubiquitous, constitutive, or development stage-specific promoter. 

5. The method of claim 3 wherein the promoter is a heat shock promoter. 

6 . The method of claim 3 wherein the promoter is inducible by the presence of a specified 
substance. 

7. The method of claim 1 wherein the host organism is a multicellular organism or a 
single-celled organism. 
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«• The method of claim 7 wherein th» 

xaxui wnerein the host organism is an insect. 

9- The method of claim 8 wherein the insect is a * . 

- ulsccr 15 a member of an insect order ^i^w^ * 

the group Coleoptera, Diptera, Hemiptera, Homo*™ * ^ 
0*%*er«. ****** or 

10. The method of claim 9 wherein «v • 

wnerem the insect is a member of the order Z>^r* 

12- ^^^10^ fctafc . — 

13- The method of claim 1 wherein the host organism is a plant. 

14. The method of claim 13 wherein the plant is a monocot. 

15. The method of claim 14 wherein the plant is selected fin™ «, 

maize, rice or wheat. selected from the group consisting of 

16. The method of claim 13 wherein the plant is a ^ 

17. The method of claim 16 wherein the plant is selected from * 

potato soybean tan™*, u ^ the group consisting of 

soybean, tomato, members of ^Brassica family, or^tf^. 

18- The method of claim 13 wherein the plant is 

19. Tbe method of claim 1 wherein the host organism is a mammal. 

20. The method of claim 19 wherein the mammal is selected from the eroun 
-ouse, rat, pig, sheep, bovine, dogorcat. 

21- ^^of^!,^^,,^^^ 
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22. 

chicken, tnrfcey, duck or goose. 

23. ^n^ofelaimlwhetetottalKWo.xaafemfaafish. 

24. ^'^ofo I ata23whe re m« Kflshisaze ^^ OT ^ 

25. ^^ofc^,^^^^^^^^^ 
oneaed wim reapec, » eBdonuctea^ * to pmWde ends-fa reeofflbtaafioo. 

26. ^^ofeUta^^ aK ^ rcom ^ ls>targetgMemodi ^ 
recombination. 

27. ^^^l^d^e^to^i,^^^^^^ 



2*. ^^^27^^,,^^^^^ 
of I-Seel, I-THI, I-Ceul, I-Ppol, I^M .orPM-sp,. 

29. ^^ofelatalw^fteexdaabfeteeons^oc^apairof 
lecombfaase recognition sites flaoking a segment of DNA comprising the segment of 
sequence hotnotogous to the targe, gene, and to host cell cofaafas a gene encoding a 
recombinase specific for said recombfaase recognition sites. 

30. ^^of<^29wi^to rec ^ imMis ^ te ^ oiicomoIofaii 
uafacMe promrter fa me bos. ceO, and «be sfcp „, excising to donor consort 
comprises inducing the recombinase. 

31. TTtemedtodofctoMwh^to^te^.^^,^^^^ 
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32. The method of claim 30 wherein the inducible promoter is induced by the presence of 
a specified substance. 

33. The method of claim 29 wherein the recombinase is under expression control of a 
tissue-specific promoter. 

34. The method of claim 29, wherein the recombinase is under expression control of a 
development stage-specific promoter, a ubiquitous promoter, mRNA encoding 
recombinase, or recombinase protein. 

35. The method of claim 29 wherein the recombinase and its specific recognition site, 
respectively, are selected from the group consisting of Cre and lox or Hp and FRT. 

36. The method of claim 1 wherein the excisable donor construct comprises a pair of 
transposase recognition sites flanking a segment of DNA comprising the segment of 
sequence homologous to the target gene and the host cell contains a gene encoding the 
transposase specific for said transposase recognition sites. 

37. The method of claim 1 wherein the excisable donor construct comprises DNA encoding 
one or more selectable markers. 

38. The method of claim 37 wherein the selectable marker provides positive selection for 
cells expressing the marker. x 

39. The method of claim 37 wherein the selectable marker provides negative selection 
against cells expressing the marker. 

40. The method of claim 37 wherein the selectable markers provide positive and negative 
selection of cells expressing the markers. 
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41 . The melhod of claim 1 wherein the excisable donor construct comprises DNA encoding 
a screenable marker. 



42. The method of claim 41 wherein the marker is selected from the group consisting of 
beta-glucuronidase, green fluorescent protein or luciferase. 

43. The method of claim 1 wherein the step of transforming the host organism includes 
transforming a germ line cell of the host organism. 

44. The method of claim 1 wherein the step of transforming the host organism consists 
essentially of transforming a somatic cell of the host organism. 

45. A transformation vector comprising a target gene modifying sequence, the m<xlifying 
sequence being homologous with a specified target gene or portion thereof, and having 
a unique endonuclease site inserted within the modifying sequence dividing said 
sequence into a first segment and a second segment. 

46. The vector of claim 45 wherein the unique endonuclease site is selected from the group 
consisting of I-Scel, I-Tlil, I-Ceul, i-Ppoi 0 r PI-PspI. 

47. The vector of claim 45 wherein the first and second segments of the target gene 
modifying sequence are in parallel orientation with one another, whereby the vector is 
adapted for ends-in recombination. 



48. 



49. 



The vector of claim 45 wherein the first and second segments of the target gene 
modifying sequence are in anti-parallel orientation with one another, whereby the 
vector is adapted for ends-out recombination. 

The vector of claim 45 wherein the first and second segments of the target gene 
m<)difying sequence are in parallel orientation with one another, whereby the vector is 
adapted for ends-out recombination. 
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50. ^^«^43<U^ eaa ^t^^ 

51- The vector of claim 50 wherein the marker gene encode., on. n r 

markjers 86116 encodes ^ or more selectable 



52. ^veo^ofcUtojO^^^^^^^ 
Toe ve«or ofeW* „ ^ fc ^ ^ ^ 

54. ^"^50^^^^^^^^ 

55. ^vec t orofclai m 50 w te re mtt K 8e m <^a S c K e B abte« rait . 
55. ^J~^55w^ te ^ te ^ ksdKtedliomte 

57. ^J^o f c to4 5 itarteco ^ a ^ oft ^ tase 
flanbng a aegrnen. „f DNA prising the .eg^on, „, ae^ence hMih ^ 

recombinase recognition sites. 

58. A ^ofgen etogafagtaa ^^ te ^ o ^ c ^ ii|g 

cloned sequence, r 

^^fcnoatorgadamtoc^ane^ihfe^ 

unique endonuclease, 

of sequence hontoiog™ to the target gene or portion there* , the ^ having a 
unique endonuclease site inserted therein, 

expressing the unique endonuclease, whereby a recombinogenic donor is 
produced, and 
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selecting for progeny of the host organism wherein recombination between the 
target and the recdmbinogenic donor has occurred. 

59. The method of claim 58 wherein the endonuclease is expressed under control of an 
inducible promoter. 



60. The method of claim 58 wherein the endonuclease is expressed under control of a 
tissue-specific promoter. 

61. The method of claim 58 wherein the endonuclease is expressed under control of a 
development stage-specific promoter. 

62. The method of claim 60 wherein the promoter is a heat shock promoter. 

63 . The method of claim 60 wherein the promoter is inducible by the presence of a 
specified substance, an ubiquitous promoter, mRNA, or a protein. 

64. Themethod of claim 58 wherein the host organism is a multiceUular organism or a 
single-celled organism. 

65. The method of claim 64 wherein the host organism is an insect. 

66. The method of claim 64 wherein the insect is a member of an insect order selected from 
the group Coleoptera, Diptera, Hemiptera, Homoptera, Hymenoptera, Lepidoptera, or 
Orthoptera. 

67. The method of claim 66 wherein the insect is a member of the order Diptera. 

68. Themem()dofclami67wheremmemsectisafruftfly^ 

69. The method of claim 67 wherein the insect is a mosquito or a medfiy. 
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70. The method of claim 58 wherein the host organism is a plant. 

71 . The method of claim 70 wherein the plant is a monocot. 

72. The method of claim 71 wh^ 
maize, rice or wheat 

*. 

73. The method of claim 70 wherein the plant is a dicot 

74. Tl* memc* of claim 73 w^ 

potato, soybean, tomato, members of the Brassica family, or Arabidopsis. 

75. The method of claim 70 wherein the plant is a tree. 

76. The method of claim 58 wherein the host organism is a mammal. 

77. The method of claim 76 wherein the mammal is selected from the group consisting of 
mouse, rat, pig, sheep, bovine, dog or cat. 

78. The method of claim 58 wherein the host organism is a bird. 

79. The method of claim 78 wherein the bird is selected from the group consisting of 
chicken, turkey, duck or goose. 

80. The melhod of claim 58 wherein the host organism is a fish. 

81. The method of claim 80 wherein the fish is a zebrafish, trout, or salmon. 

82. The method of claim 58 wherein the donor construct is a target gene modifying 
sequence oriented with respect to the endonuclease site to provide ends-in 
recombination. 
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recombination. 



84. The method of claim 58 wherein the endonuclease is selected from the group listing 
of rare-cutting endonucleases. 

85. The method of claim 84 wherein the erdonuclease is selected from the group consisting 
of I-Scel, I-TUI, I-Crel, R^m, i,pp oI or PI-PspI. 

86. The method of claim 58 wherein the donor construct comprises DNA encoding one or 
more selectable markers. 

87. The method of claim 86 wherein the selectable marker provides positive selection for 
cells expressing the marker. 

88. The method of claim 86 wherein the selectable marker provides negative selection 
against cells expressing the marker. 

89. The method of claim 86 wherein the selectable marker provides positive and negative 
selection for cells expressing the marker. 

90. The method of claim 58 wherein the donor construct comprises DNA encoding a 
screenable marker. 

91. The method of claim 90 wherein the marker is selected from the group consisting of 
beta-glucuronidase, green fluorescent protein or luciferase. 

4 

92. The method of claim 58 wherein the step of transforming the host organism includes 
transforming a germ line cell of the host organism. 
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93. ^^otc^5B^^ of ^^^^ o ^ c ^ 
essentially of transforming a somatic cell of the host organism. 

94. 11. method of claim 58 wherem me step of 

essentially of transforming a gamete cell of the host organism. 
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